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C3B/C4B COMPLEMENT RECEPTOR-LIKE MOLECULES 
AND USES THEREOF 

-5* This application claims the benefit of U.S. 

Provisional Application No. 60/222,504, filed August 2, 
2000 and U.S. Application No. 09/728,787 filed November 
^2 8, 2000, which are hereby incorporated by reference. 

10 Field of the Invention 

The present invention relates to novel C3b/C4b 
Complement Receptor- like polypeptides and nucleic acid 
molecules encoding the same. The invention also 
relates to vectors, host cells, pharmaceutical 
15 compositions, selective binding agents and methods for 
producing C3b/C4b Complement Receptor- like 

polypeptides. Also provided for are methods for the 
diagnosis, treatment, amelioration, and/or prevention 
of diseases associated with C3b/C4b Complement 

2 0 Receptor- like polypeptides. 

Background of the Invention 

Technical advances in the identification, cloning, 
expression and manipulation of nucleic acid molecules 
25 and the deciphering of the human genome have greatly 
accelerated the discovery of novel therapeutics. Rapid 
nucleic acid sequencing techniques can now generate 
sequence information at unprecedented rates and, 
coupled with computational analyses, allow the assembly 

3 0 of overlapping sequences into partial and entire 

genomes and the identification of polypeptide -encoding 
regions. A comparison of a predicted amino acid 
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'sequence against a database compilation of known amino 
acid sequences allows one to determine the extent of 
homology to previously identified sequences and/or 
'structural landmarks. The cloning and expression of a 

"5 polypeptide- encoding region of a nucleic acid molecule 
provides a polypeptide product for structural and 
functional analyses. The manipulation of nucleic acid 
molecules and encoded polypeptides may confer 
advantageous properties on a product for use as a 

10 therapeutic. 

Despite the significant technical advances in 
genome research over the past decade , the potential for 
the development of novel therapeutics based on the 
human genome is still largely unrealized. Many genes 
15 encoding potentially beneficial polypeptide 

therapeutics, or those encoding polypeptides, which may 
act as "targets" for therapeutic molecules, have still 
not been identified. 

Accordingly, it is an object of the invention to 
20 identify novel polypeptides and nucleic acid molecules 
encoding the same, which have diagnostic or therapeutic 
benefit. 

Summary of the Invention 

25 The present invention relates to novel C3b/C4b 

Complement Receptor-like nucleic acid molecules and 
encoded polypeptides. 

The invention provides for an isolated nucleic 
acid molecule comprising a nucleotide sequence selected 
30 from the group consisting of: 

(a) the nucleotide sequence as set forth in SEQ 
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ID NO : 1 , SEQ ID NO : 3 , or SEQ ID NO:6; 

(b) a nucleotide sequence encoding the 
polypeptide as set forth in SEQ ID NO:2, SEQ ID NO: 4, 
or SEQ ID NO:7; 

5 (c) a nucleotide sequence which hybridizes under 

moderately or highly stringent conditions to the 
complement of (a) or (b) , wherein the encoded 
polypeptide has an activity of the polypeptide as set 
forth in SEQ ID NO:2, SEQ ID NO: 4 , or SEQ ID NO: 7; and 

10 (d) a nucleotide sequence complementary to any of 

(a) -(c). 

The invention also provides for an isolated 
nucleic acid molecule comprising a nucleotide sequence 
15 selected from the group consisting of: 

(a) a nucleotide sequence encoding a polypeptide 
that is at least about 70, 75, 80, 85, 90, 95, 96, 97, 
98, or 99 percent identical to the polypeptide as set 
forth in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID NO: 7, 

20 wherein the polypeptide has a!n activity of the 
polypeptide as set forth in SEQ ID NO:2, SEQ ID NO: 4, 
or SEQ ID NO: 7; 

(b) a nucleotide sequence encoding an allelic 
variant or splice variant of the nucleotide sequence as 

25 set forth in SEQ ID NO:l, SEQ ID NO: 3, or SEQ ID NO: 6, 
wherein the encoded polypeptide has an activity of the 
polypeptide as set forth in SEQ ID NO: 2, SEQ ID NO: 4, 
or SEQ ID NO: 7; 

(c) a nucleotide sequence of SEQ ID NO:l, SEQ ID 
30 NO:3, or SEQ ID NO : 6 , (a), or (b) encoding a 

polypeptide fragment of at least about 25 amino acid 
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'residues, wherein the polypeptide has an activity of 
the polypeptide as set forth in SEQ ID NO: 2, SEQ ID 
NO:4, or SEQ ID NO: 7; 

(d) a nucleotide sequence of SEQ ID NO:l, SEQ ID 
5 NO:3, or SEQ ID NO : 6 ,' or (a) - (c) comprising a fragment 

of at least about 16 nucleotides; 

(e) a nucleotide sequence which hybridizes under 
moderately or highly stringent conditions to the 
complement of any of (a) - (d) , wherein the polypeptide 

10 has an activity of the polypeptide as set forth in SEQ 
ID NO: 2, SEQ ID NO: 4, or SEQ ID NO: 7; and 

(f) a nucleotide sequence complementary to any of 
(a) -(e) . 

15 The invention further • provides for an isolated 

nucleic acid molecule comprising a nucleotide sequence 
selected from the group consisting of: 

(a) a nucleotide sequence encoding a polypeptide 
as set forth in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID 

2 0 NO: 7, with at least one conservative amino acid 
substitution, wherein the polypeptide has an activity 
of the polypeptide as set forth in SEQ ID NO:2, SEQ ID 
NO:4, or SEQ ID NO:7; 

(b) a nucleotide sequence encoding a polypeptide 

2 5 as set forth in SEQ ID NO: 2, SEQ ID N0:4, or SEQ ID 

NO: 7 with at least one amino acid insertion, wherein 
the polypeptide has an activity of the polypeptide as 
set forth in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID NO: 7; 

(c) a nucleotide sequence encoding a polypeptide 

3 0 as set forth in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID 

NO: 7 with at least one amino acid deletion, wherein the 
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polypeptide has an activity of the polypeptide as set 
forth in SEQ ID 1*0:2, SEQ ID 1*0:4, or SEQ ID N0:7; 

(d) a nucleotide sequence encoding a polypeptide 
as set forth in SEQ ID NO: 2, SEQ ID NO : 4 , or SEQ ID 

5 NO:7 which* has a C- and/or N- terminal truncation, 
wherein the . polypeptide has an activity of the 
polypeptide as set forth in SEQ ID NO: 2, SEQ ID NO: 4, 
or SEQ ID NO: 7; 

(e) a nucleotide sequence encoding a polypeptide 
10 as set forth in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID 

NO: 7 with at least one modification selected from the 
group consisting of amino acid substitutions, amino 
acid insertions, amino acid deletions, C- terminal 
truncation, and N-terminal truncation, wherein the 
15 polypeptide has an activity of the polypeptide as set . 
forth in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID NO:7; 

(f ) a nucleotide sequence of (a) - (e) comprising a 
fragment of at least about 16 nucleotides ; 

(g) a nucleotide sequence which hybridizes under 
20 moderately or highly stringent conditions to the 

complement of any of (a) - (f ) , wherein the polypeptide 
has an activity of the polypeptide as set forth in SEQ 
ID NO:2, SEQ ID NO:4, or SEQ ID NO:7; and 

(h) a nucleotide sequence complement ary to any of 
25 (a) - (e). 

The invention also provides for an isolated 
polypeptide comprising the amino acid sequence selected 
from the group consisting of: 

(a) an amino acid sequence of the mature C3b/C4b 
30 Complement Receptor- like polypeptide wherein the 
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polypeptide comprises the amino acid sequence contained 
within SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO: 7, and 
optionally further comprises an amino- terminal 
methionine; 

5 (b) an amino acid sequence for an ortholog of SEQ 

ID NO: 2, SEQ ID NO:4, or SEQ ID NO: 7, wherein the 
encoded polypeptide has an activity of the polypeptide 
as set forth, in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID 
NO : 7 ; 

10 (c) an amino acid sequence that is at least about 

70, 80,. 85, 90, 95, 96, 97, 98, or 99 percent identical 
to the amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4 , 
or SEQ ID NO: 7, wherein the polypeptide has an activity 
of the polypeptide as set forth in SEQ ID NO:2, SEQ ID 

15 NO:4, or SEQ ID NO: 7; 

(d) a fragment of the amino acid sequence set 
forth in SEQ ID NO: 2, SEQ ID NO:4, or SEQ ID NO: 7 
comprising at least about 25 amino acid residues, 
wherein the polypeptide has an activity of the 

2 0 polypeptide as set forth in SEQ ID NO: 2, SEQ ID NO:4, 

or SEQ ID NO: 7; 

(e) an amino acid sequence for an allelic variant 
or splice variant of either the amino acid sequence as 
set forth in SEQ ID NO:2, SEQ ID NO: 4, or SEQ ID NO: 7, 

25 or at least .one of (a) - (c) wherein the polypeptide has 
an activity of the polypeptide as set forth in SEQ ID 
NO:2, SEQ ID NO: 4, or SEQ ID NO : 7 . 

The invention further provides for an isolated 

3 0 polypeptide comprising the amino acid sequence selected 

from the. group consisting of: 
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(a) the amino acid sequence as set forth in SEQ ID 
NO: 2, SEQ ID NO:4, or SEQ ID NO: 7 with at least one 
conservative amino acid substitution, wherein the 

♦ polypeptide has an activity of the polypeptide as set 
• 5 forth in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID NO: 7; 

(b) the amino acid sequence as set forth in SEQ ID 
NO: 2, SEQ ID NO : 4 , or SEQ ID NO : 7 with at least one 
amino acid insertion, wherein the polypeptide has an 
activity of the polypeptide as set forth in SEQ ID 

10 NO: 2, SEQ ID NO: 4, or SEQ ID NO: 7; 

(c) the amino acid sequence as set forth in SEQ ID 
NO:2, SEQ ID NO:4, or SEQ ID NO:7 with at least one 
amino acid deletion, wherein the polypeptide has an 
activity of the polypeptide as set forth in SEQ ID 

15 NO:2, SEQ ID NO : 4 , or SEQ ID NO:7; 

(d) the amino acid sequence as set forth in SEQ ID 
NO: 2, SEQ ID NO: 4, or SEQ ID NO: 7 which has a C- and/or 
N-terminal truncation, wherein the polypeptide has an 
activity of the polypeptide as set forth in SEQ ID 

20 NO: 2, SEQ ID NO:4, or SEQ ID NO: 7; and 

(e) the amino acid sequence as set forth in SEQ ID 
NO:2, SEQ ID NO:4, or SEQ ID NO: 7, with at least one 
modification selected from the group consisting of 
amino acid substitutions, amino acid insertions, amino 

25 acid deletions, C- terminal truncation, and N-terminal 
truncation, wherein the polypeptide has an activity of 
the polypeptide as set forth in SEQ ID NO:2, SEQ ID 
NO: 4, or SEQ ID NO: 7. 

30 Also provided are fusion polypeptides comprising 

the amino acid sequences of (a) - (e) above. 
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: ~* ; The present invention also provides for an 

expression vector comprising the isolated nucleic acid 
molecules as set forth herein, recombinant host cells 
•comprising recombinant nucleic acid molecules as set 
•5 forth herein, and a method of producing a C3b/C4b 
Complement Receptor- like polypeptide comprising 
culturing the host cells and optionally isolating the 
polypeptide so produced. 

A transgenic non-human animal comprising a nucleic 
10 acid molecule encoding a C3b/C4b Complement Receptor- 
like polypeptide is also encompassed by the invention. 
The C3b/C4b Complement Receptor-like nucleic acid 
molecules are introduced into the animal in a manner 
that allows expression and increased levels of the 
15 C3b/C4b Complement Receptor-like polypeptide, which may 
include increased circulating levels. The transgenic 
non-human animal is preferably a mammal . 

Also provided are derivatives of the C3b/C4b 
Complement Receptor- like polypeptides of the present 
20 invention. 

Additionally provided are selective binding- agents 
such as antibodies and peptides capable of specifically 
binding the C3b/C4b Complement Receptor- like 
polypeptides of the invention. Such antibodies and 
25 peptides may be agonistic or antagonistic. 

Pharmaceutical compositions comprising the 
nucleotides, polypeptides, or selective binding agents 
of the present invention and one or more 
pharmaceutically acceptable formulation agents are also 
30 encompassed by the invention. The pharmaceutical 
compositions are used to provide therapeutically 
effective amounts of the nucleotides or polypeptides of 
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the present invention. The invention is also directed 
to methods of using the polypeptides, nucleic acid 
molecules, and selective binding agents. 

The C3b/C4b Complement Receptor-like polypeptides 
5 and nucleic acid molecules of the present invention may 

Hp* n<=Jf=*H t~ o hroaf nrpvpnt- ^ mp 1 inraf-o anH/pr r}*»t- 

— — — — — — *~ i xr — — " — ~— — t — »» , • — ~- ^ *r ■ — | v- . >-* w w x_ »_ 

diseases and disorders, including those recited herein. 

The present invention also provides a method of 
assaying test molecules to identify a test molecule 

10 which binds to a C3b/C4b Complement Receptor- like 
polypeptide. The method comprises contacting a C3b/C4b 
Complement Receptor- like polypeptide with a test 
molecule and determining -the extent of binding of the 
test molecule to the polypeptide. The method further 

15 comprises determining whether such test molecules are 
agonists or antagonists of a C3b/C4b Complement 
Receptor-like polypeptide. The present invention 

further provides a method of testing the impact of 
molecules on the expression of C3b/C4b Complement 

2 0 Receptor-like polypeptide or on the activity of C3b/C4b 

Complement Receptor-like polypeptide. 

Methods of regulating expression and modulating 
(i.e., increasing or decreasing) levels of a C3b/C4b 
Complement Receptor-like polypeptide are also 
25 encompassed by the invention. One method comprises 
administering to an animal a nucleic acid molecule 
encoding a C3b/C4b Complement Receptor-like 
polypeptide. In another method, a nucleic acid 

molecule comprising elements that regulate or modulate 

3 0 the expression of a C3b/C4b Complement Receptor-like 

polypeptide may be administered. Examples of these 



9 



WO 02/10199 



PCT/US01/23232 



methods include gene therapy, cell therapy, and anti- 
sense therapy as further described herein. 

The C3b/C4b Complement Receptor-like polypeptide 
can be used for identifying ligands thereof. Various 
5 forms of "expression cloning" have been used for 
cloning ligands for receptors. See e.gr., Davis et al . , 
Cell, 87:1161-1169 (1996). These and other C3b/C4b 
Complement Receptor-like ligand cloning experiments are 
described in greater detail herein. Isolation of the 

10 C3b/C4b Complement Receptor-like ligand (s) allows for 
the identification or development of novel agonists 
and/or antagonists of the C3b/C4b Complement Receptor- 
like signaling pathway. Such agonists and antagonists 
include C3b/C4b Complement Receptor-like ligand (s), 

15 anti-C3b/C4b Complement Receptor-like ligand antibodies 
and derivatives thereof, small molecules, or antisense 
oligonucleotides, any of which can be used for 
potentially treating one or more diseases or disorders, 
including those recited herein. 

2 0 Brief Description of the Figures 

Figure 1 depicts a nucleic acid sequence (SEQ ID 
NO:l) encoding human C3b/C4b Complement Receptor-like 
polypeptide. Also depicted is the amino acid sequence 
(SEQ ID NO: 2) of human C3B/C4b Complement Receptor - 
25 like polypeptide. 

Figure 2 depicts a nucleic acid sequence (SEQ ID 
NO: 6) encoding a second human C3b/C4b Complement 
Receptor-like polypeptide. Also depicted is the amino 
acid sequence (SEQ ID NO: 7) of human C3B/C4b 
30 Complement Receptor- like polypeptide. 
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Figure 3 depicts a nucleic acid sequence (SEQ ID 
NO: 3) encoding rat C3b/C4b Complement Receptor- like 
polypeptide. Also depicted is the amino acid sequence 
'of rat C3b/C4b Complement Receptor-like polypeptide 
-5 (SEQ ID NO: 4) . 

Figure 4 depicts an amino acid comparison of a 
known human C3b/C4b Complement Receptor (SEQ ID NO: 5) 
and the human AGP-41773 (SEQ ID NO: 2) . 

10 Detailed Description of the Invention 

The section headings used herein are for 
organizational purposes only and are not to be 
construed as limiting the subject matter described. 
All references cited in this application are expressly 
15 incorporated by reference herein. 

Definitions 

The term n C3b/C4b Complement Receptor- like" is 
abbreviated herein as "C3b/C4b CR-like" and is also 
20 referred to as "AGP-41773 " . The terms "C3b/C4b CR-like 
gene" or "C3b/C4b CR-like nucleic acid molecule" or 
."polynucleotide" refers to a nucleic acid molecule 
comprising or consisting of a nucleotide sequence as 
set forth in SEQ ID NO:l, SEQ ID NO: 3, or SEQ ID NO: 6, 

2 5 a nucleotide, sequence encoding the polypeptide as set 

forth in SEQ ID NO: 2, SEQ ID N0:4, or SEQ ID NO: 7, and 
nucleic acid molecules as defined herein. 

The term M C3b/C4b CR-like polypeptide" refers to a 
polypeptide comprising the amino acid sequence of SEQ 

3 0 ID NO:2, SEQ ID NO : 4 , or SEQ ID NO: 7, and related 

polypeptides. Related polypeptides include: C3b/C4b 
CR-like polypeptide allelic variants, C3b/C4b CR-like 
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polypeptide orthologs, C3b/C4b CR-like polypeptide 
splice variants, C3b/C4b CR-like polypeptide variants 
and C3b/C4b CR-like polypeptide derivatives. C3b/C4b 
CR-like polypeptides may be mature polypeptides, as 
5 defined . herein, and may or may not have an amino 
terminal methionine residue, depending on the method by 
which they are prepared. 

The term M C3b/C4b CR-like polypeptide allelic 
variant" refers to one of several possible naturally 
10 occurring alternate forms of a gene occupying a given 
locus on a chromosome of an organism or a population of 
organisms. 

The term "C3b/C4b CR-like polypeptide derivatives" 
refers to the polypeptide as set forth in SEQ ID NO:2, 

15 SEQ IB NOi4, or SEQ ID NO ; 7 , C3b/C4b CR-like 
polypeptide allelic variants, C3b/C4b CR-like 
polypeptide orthologs, C3b/C4b CR-like polypeptide 
splice variants, or C3b/C4b CR-like polypeptide 
variants, as defined herein, that have been chemically 

20 modified. 

The term l! C3b/C4b CR-like polypeptide fragment" 
refers to a polypeptide that comprises a truncation at 
the amino terminus (with or without a leader sequence) 
and/or a truncation at the carboxy terminus of the 
polypeptide as set forth in SEQ ID NO: 2, SEQ ID NO:4, 
or SEQ ID NO: 7, C3b/C4b CR-like polypeptide allelic 
variants, C3b/C4b CR-like polypeptide orthologs, 
C3b/C4b CR-like polypeptide splice variants and/or a 
C3b/C4b CR-like polypeptide variant having one or more 
amino acid additions or substitutions or internal 
deletions (wherein the resulting polypeptide is at 



25 



30 
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'least 6 amino acids or more in length) as compared to 
the C3b/C4b CR-like polypeptide amino acid sequence set 
forth in SEQ ID NO:2, SEQ ID NO: 4, or SEQ ID NO: 7* 
<C3b/C4b CR-like polypeptide fragments may result from 

-5 alternative RNA splicing or from in vivo protease 
activity. For transmembrane or membrane -bound forms of 
a C3b/C4b CR-like polypeptide, preferred fragments 
^include soluble forms such as those lacking a 
transmembrane or membrane -binding domain. In preferred 

10 embodiments, truncations comprise about 10 amino acids, 
or about 20 amino acids, or about 50 amino acids, or 
about 75 amino acids, or about 100 amino acids, or more 
than about 100 amino acids. The polypeptide fragments 
so produced will comprise about 25 contiguous amino. 

15 acids, or about 50 amino acids, or about 75 amino 
acids, or about 100 amine acids, or about 150 amino . 
acids, or about 200 amino acids. Such C3b/C4b CR-like 
polypeptide fragments may optionally comprise an amino 
terminal methionine residue. It will be appreciated 

20 that such fragments can be used, for example, to 
generate antibodies to C3b/C4b CR-like polypeptides. 

The term "C3b/C4b CR-like fusion polypeptide" 
refers to a fusion of one or more amino acids (such as 
a heterologous peptide or polypeptide) at the amino or 

25 carboxy terminus of the polypeptide as set forth in SEQ 
ID NO: 2, SEQ ID N0:4, or SEQ ID NO: 7, C3b/C4b CR-like 
polypeptide allelic variants, C3b/C4b CR-like 
polypeptide orthologs, C3b/C4b CR-like polypeptide 
splice variants, or C3b/C4b CR-like polypeptide 

30 variants having one or more amino acid deletions, 
substitutions or internal additions as compared to the 
C3b/C4b CR-like polypeptide amino acid sequence set 
forth in SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:7. 
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The term "C3b/C4b CR-like polypeptide ortholog" 
refers to a polypeptide from another species that 
corresponds to C3b/C4b CR-like polypeptide amino acid 
'sequence as set forth in SEQ ID NO: 2, SEQ ID NO: 4, or 
*"5 SEQ ID NO: 7. For example, mouse and human C3b/C4b CR- 
like polypeptides are considered orthologs of each 
other. 

The term n C3b/C4b CR-like polypeptide splice 
variant" refers to a nucleic acid molecule, usually 
10 RNA, which is generated by alternative processing of 
intron sequences in an RNA transcript of C3b/C4b CR- 
like polypeptide amino acid sequence as set forth in 
SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO : 7 . 

The term "C3b/C4b CR-like polypeptide variants" 

15 refers to C3b/C4b CR-like polypeptides comprising amino . 
acid sequences having one or more amino acid sequence 
substitutions , deletions* (such as internal deletions 
and/or C3b/C4b CR-like polypeptide fragments), and/ ox 
additions (such as internal additions and/or C3b/C4b 

20 CR-like fusion polypeptides) as compared to the C3b/C4b 
CR-like polypeptide amino acid sequence set forth in 
SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID NO: 7 (with or 
without a leader sequence) . Variants may be naturally 
occurring {e.g., C3b/C4b CR-like polypeptide allelic 

25 variants, C3b/C4b CR-like polypeptide orthologs and 
C3b/C4b CR-like polypeptide splice variants) or 
artificially constructed. Such C3b/C4b CR-like 

polypeptide variants may be prepared from the 
corresponding nucleic acid molecules having a DNA 

30 sequence that varies accordingly from the DNA sequence 
as set forth in SEQ ID NO:l, SEQ ID NO: 3, or SEQ ID 
NO: 6. In preferred embodiments, the variants have from 
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1 to 3, or from 1 to 5, or from 1 to 10, or from 1 to 
15, or from 1 to 20, or from 1 to 25, or from 1 to 50, 
or from 1 to 75, or from 1 to 100, or more than 100 
amino acid substitutions, insertions, additions and/or 
5 deletions, wherein the substitutions may be 
conservative, or non-conservative, or any combination 
thereof . 

The term "antigen" refers to a molecule or a 
port i on of a mol ecul e capabl e of be ing bound by a 
10 selective binding agent, such as an antibody, and 
additionally capable of being used in an animal to 
produce antibodies • capable of binding to an epitope of 
that antigen. An antigen may have one or more 
epitopes . 

15. The term "biologically active C3b/C4b CR-like 

polypeptides" refers to • C3b/C4b CR-like polypeptides 
having at least one activity characteristic of the 
polypeptide comprising the amino acid . sequence of SEQ 
ID NO:2, SEQ ID NO : 4 , or SEQ ID NO: 7. 

20 The terms "effective amount" and "therapeutically 

effective amount" each refer to the amount of a C3b/C4b 
CR-like polypeptide or C3b/C4b CR-like nucleic acid 
molecule used to support an observable level of one or 
more biological activities of the C3b/C4b CR-like 

25 polypeptides as set forth herein. 

The term "expression vector" refers to a vector 
which is suitable for use in a host cell and contains 
nucleic acid sequences which direct and/or control the 
expression of heterologous nucleic acid sequences. 
30 Expression includes, but is not limited to, processes 
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such as transcription, translation, and RNA splicing, 
if introns are present. 

The term "host cell" is used to refer to a cell 
which has been transformed, or is capable of being 
5 transformed with a nucleic acid sequence and then of 
expressing a selected gene of interest. The term 
includes the progeny of the parent cell, whether or not 
the progeny is identical in morphology or in genetic 
make-up to the original parent, so long as the selected 
10 gene is present. 

The term "identity" as known in the art, refers to 
a relationship between the sequences of two or more 
polypeptide molecules or two or more nucleic acid 
molecules, as determined by comparing the sequences. 

15 In the art, "identity"* also means the degree of 
sequence relatedness between nucleic acid molecules or 
polypeptides, as the case may be, as determined by the 
match between strings of two or more nucleotide or two ■ 
or more amino acid sequences. "Identity" measures the 

20 percent of identical matches between the smaller of two 
or more sequences with gap alignments (if any) 
addressed by a particular mathematical model or 
computer program (i.e., "algorithms" ) . 

The term "similarity" is a related concept, but in 
25 contrast to "identity", refers to a measure of 
similarity which includes both identical matches and 
conservative substitution matches. If two polypeptide 
sequences have, for example, 10/20 identical amino 
acids, and the remainder are all non- conservative 
3 0 substitutions, then the percent identity and similarity 
would both be 50%. If in the same example, there are 5 
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'more positions where there are conservative 
substitutions, then the percent identity remains 50%, 
but the per cent similarity would be 75% (15/20) . 
'Therefore, in cases where there are conservative 
'-5 substitutions, the degree of similarity between two 
polypeptides will be higher than the percent identity 
between those two polypeptides. 

The term "isolated nucleic acid molecule" refers 
to a nucleic acid molecule of the invention that (1) 

10 has been separated from at least about 50 percent of 
proteins, lipids, carbohydrates or other materials with 
which it is naturally found when total DNA is isolated 
from the source cells, (2) is not linked to all or a 
portion of a polynucleotide to which the "isolated 

15 nucleic acid molecule" is linked in nature. (3) is 
operably linked to a polynucleotide which it is not 
linked to in nature, or (4) does not occur in nature as 
part of a larger polynucleotide sequence. Preferably, 
the isolated nucleic acid molecule of the present 

20 invention is substantially free from any other 
contaminating nucleic acid molecule (s) or other 
contaminants that are found in its natural environment 
that would interfere with its use in polypeptide 
production or its therapeutic, diagnostic, prophylactic 

25 or research use. 

The term "isolated polypeptide" refers to a 
polypeptide of the present invention that (1) has been 
separated from at least about 50 percent of 
polynucleotides, lipids, carbohydrates or other 
30 materials with which it is naturally found when 
isolated from the source cell, (2) is not linked (by 
covalent or noncovalent interaction) to all or a 
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* portion of a polypeptide to which the "isolated 
polypeptide" is linked in nature, (3) is operably 
linked (by covalent or noncovalent interaction) to a 
•polypeptide with which it is not linked in nature, or 
-5 (4) does not occur in nature. Preferably, the isolated 
polypeptide is substantially free from any other 
contaminating polypeptides or other contaminants that 
are found in its natural environment that would 
interfere with its therapeutic, diagnostic, 
10 prophylactic or research use. 

The term "mature C3b/C4b CR-like polypeptide" 
refers to a C3b/C4b CR-like polypeptide lacking a 
leader sequence. A mature C3b/C4b CR-like polypeptide 
may also include other modifications such as 
15 proteolytic processing of the amino terminus (with or 
without a leader sequence) and/or the carboxy terminus, 
cleavage of a smaller polypeptide from a larger 
precursor, N- linked and/or O- linked glycosylation, and 
the like. 

2 0 The term "nucleic acid sequence" or "nucleic acid 

molecule" refers to a DNA or RNA sequence. The term 
encompasses molecules formed from any of the known base 
analogs of DNA and RNA such as, but not limited to 4- 
acetylcytosine , 8 -hydroxy-N6 -methyl adenosine , 

25 aziridinyl-cytosine, pseudoisocytosine, 5- 

(carboxyhydroxylmethyl) uracil, 5-f luorouracil , 5- 
bromouracil , 5-carboxymethylaminomethyl-2-thiouracil , 
5-carboxy-methylaminomethyluracil , dihydrouracil , 

inosine , N6 - iso-pent enyladenine , 1 -methyladenine , 1 - 

3 0 methylpseudouracil , 1 -methylguanine , 1 -methyl inosine , 

2 , 2 -dimethyl -guanine, 2 -methyladenine , 2 -methylguanine, 
3 -methyl cytosine, 5 -methyl cytos ine , N6 -methyladenine, 
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7-methylguanine, 5-methylaminomethyluracil , 5- 

me t hoxy ami no - me t hy 1 - 2 - 1 hi our ac i 1 , be t a - D - 

mannosylqueosine, 5 1 -methoxycarbonyl -methyluracil , 5- 
methoxyuracii , 2 -methylthio-N6 - isopentenyladenine , 

5 uracil- 5 -oxyace tic acid methylester, uracil- 5 -oxyacetic 
acid, oxybutoxosine, pseudouracil , queosine, 2- 
thiocyt osine , 5~methyl - 2 - thiouracil , 2 - thiouracil , 4 - 
thiouracil , 5 -methyluracil , N-uracil -5-oxyacetic acid 
methylester, uracil-5-oxyacetic acid, pseudouracil, 
10 queosine, 2-thiocytosine, and 2 , 6-diaminopurine . 

The term "naturally occurring" or "native" when 
used in connection with biological materials such as 
nucleic acid molecules, polypeptides, host cells, and 
the like, refers to materials which are found in nature 
15 and are not manipulated by man. Similarly, "non- 
naturally occurring" or "non-native" as used herein 
refers to a material that is not found in nature or 
that has been structurally modified or synthesized by 
man. 

20 The term "operably linked" is used herein to refer 

to an arrangement of flanking sequences wherein the 
flanking sequences so described are configured or 
assembled so as to perform their usual function. Thus, 
a flanking sequence operably linked to a coding 

25 sequence may- be capable of effecting the replication, 
transcription and/or translation of the coding 
sequence. For example, a coding sequence is operably 
linked to a promoter when the promoter is capable of 
directing transcription of that coding sequence. A 

3 0 flanking sequence need not be contiguous with the 
coding sequence, so long as it functions correctly. 
Thus, for example, intervening untranslated . yet 
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'transcribed sequences can be present between a promoter 
sequence and the coding sequence and the promoter 
sequence can still be considered "operably linked" to 
' the coding sequence • 

5 The term "pharmaceutical^ acceptable carrier" or 

"physiologically acceptable carrier" as used herein 
, refers to one or more formulation materials suitable 
for accomplishing or enhancing the delivery of the 
C3b/C4b CR-like polypeptide, C3b/C4b CR-like nucleic 
10 acid molecule or C3b/C4b CR-like selective binding 
agent, as a pharmaceutical composition. 

The term "selective binding agent" refers to a 
molecule or molecules having specificity for a C3B/C4B 
CR-like polypeptide. As used herein, the terms, 

15 "specific" and "specificity" refer to the ability of 
the selective binding agents to bind to human C3b/C4b 
CR-like polypeptides and not to bind to human non- 
C3b/C4b CR-like polypeptides. It will be appreciated, 
however, that the selective binding agents may also 

20 bind orthologs of the polypeptide as set forth in SEQ 
ID NO:2, SEQ ID NO:4, or SEQ ID NO: 7, that is, 
interspecies versions thereof, such as mouse and rat 
polypeptides . 

The term "transduction" is used to refer to the 
25 transfer of genes from one bacterium to another, 
usually by a phage. "Transduction" also refers to the 
acquisition and transfer of eukaryotic cellular 
sequences by retroviruses. 

The term "transf ection" is used to refer to the 
3 0 uptake of foreign or exogenous DNA by a cell, and a 
cell has been "transf ected" when the exogenous DNA has 
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been introduced inside the cell membrane. A number of 
transfection techniques are well known in the art and 
are disclosed herein. See, for example, Graham et al., 
Virology, 52:456 (1973); Sambrook et al . , Molecular 
5 Cloning,. a laboratory Manual, Cold Spring Harbor 
Laboratories (New York, 1989); Davis et al., Basic 
Methods in Molecular Biology, Elsevier, 1986; and Chu * 
et al., Gene, 13^:197 (1981). Such techniques can be 
used to introduce one or more exogenous DNA moieties 
10 into suitable host cells. 

The' term "transformation" as used herein refers to 
a change in a cell's genetic characteristics, and a 
cell has been transformed when it has been modified to 
contain a new DNA. For example, a cell is transformed 

15 where it is genetically modified from its native state. 
Following transfection or transduction, the 
transforming DNA may recombine with that of the cell by -■ 
physically integrating into a chromosome of the cell, 
may be maintained transiently as an episomal element 

20 without being . replicated, or may replicate 
independently as a plasmid. A cell is considered to 
have been stably transformed when the DNA is replicated 
with the division of the cell. 

The term "vector" is used to refer to any molecule 
25 (e.g., nucleic acid, plasmid, or virus) used to 
transfer coding information to a host cell. 

Relatedness of Nucleic Acid Molecules 
and/or Polypeptides 

It is understood that related nucleic acid 
30 molecules include allelic or splice variants of the 
nucleic acid molecule of SEQ ID NO:l, SEQ ID NO ; 3 , or 
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SEQ ID NO: 6, and include sequences which are 
complementary to any of the above nucleotide sequences. 
Related nucleic acid molecules also include a 
nucleotide sequence encoding a polypeptide comprising 
5 . or consisting essentially of a substitution, 
modification, addition and/or a deletion of one or more 
amino acid residues compared to the polypeptide in SEQ 
ID NO:2, SEQ ID NO:4, or SEQ ID NO : 7 . 

Fragments include molecules which encode a 
10 polypeptide of at least about 25 amino acid residues, 
or about 50, or about 75, or about 100, or greater than 
about 100 amino acid residues of the polypeptide of SEQ 
ID NO:2, SEQ ID NO:4, or SEQ ID NO : 7 . 

In addition, related C3b/C4b CR-like nucleic acid 

15 molecules include those molecules which comprise 
nucleotide sequences which hybridize under moderately 
or highly stringent conditions as defined herein with 
the fully complementary sequence of the nucleic acid 
molecule of SEQ ID NO:l, SEQ ID NO: 3, or SEQ ID NO: 6, 

20 or of a molecule encoding a polypeptide, which 
polypeptide comprises the amino acid sequence as shown 
in SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:7, or of a 
nucleic acid fragment as defined herein, or of a 
nucleic acid fragment encoding a polypeptide as defined 

25 herein. Hybridization probes may be prepared using the 
C3b/C4b CR-like sequences provided herein to screen 
cDNA, genomic or synthetic DNA libraries for related 
sequences. Regions of the DNA and/or amino acid 
sequence of C3b/C4b CR-like polypeptide that exhibit 

30 significant identity to known sequences are readily 
determined using sequence alignment algorithms as 
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'described herein and those regions may be used to 
design probes for screening. 

The term "highly stringent conditions" refers to 
those conditions that are designed to permit 
5 hybridization of DNA strands whose sequences are highly 
complementary, and to exclude hybridization of 
.significantly mismatched DNAs . Hybridization 
stringency is principally determined by temperature, 
ionic strength, and the concentration of denaturing 

10 agents such as formamide. Examples of "highly 
stringent conditions" for hybridization and washing are 
0.015M sodium chloride, 0 . 0015M . sodium citrate at 65- 
68°C or 0.015M sodium chloride, 0.0015M sodium citrate, 
and 50% formamide at 42°C. See Sambrook, Fritsch & 

15 Maniatis, Molecular Cloning: A Laboratory Manual, 2 nd 
Ed. , Cold Spring Harbor Laboratory, (Cold Spring 
Harbor, N. Y. 1989); Anderson et al . , Nucleic Acid 
Hybridisation: a practical approach, Ch. 4, IRL Press 
Limited (Oxford, England) . 

2 0 More stringent conditions (such as higher 

temperature, lower ionic strength, higher formamide, or 
other denaturing agent) may also be used, however, the 
rate of hybridization will be affected. Other agents 
may be included in the hybridization and washing 

25 buffers for the purpose of reducing non-specific and/or 
background hybridization. Examples are 0.1% bovine 
serum albumin, 0.1% polyvinyl -pyrrol idone, 0.1% sodium 
pyrophosphate, 0.1% sodium dodecyl sulfate (NaDodS0 4 or 
SDS) , ficoll, Denhardt's solution, sonicated salmon 

30 sperm DNA (or other non- complementary DNA), and dextran 
sulfate, although other suitable agents can also be 
used. The concentration and types of these additives 
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can be changed without substantially . affecting the 
stringency of the hybridization conditions. 
Hybridization experiments are usually carried out at pH 
'6.8-7.4, however, at typical ionic strength conditions, 
"'5 the rate of hybridization is nearly independent of pH. 
See Anderson et al., Nucleic Acid Hybridisation: a 
Practical Approach, Ch . 4, IRL Press Limited (Oxford, 
England) . 

Factors affecting the stability of a DNA duplex 
10 include base composition, length, and degree of base 
pair mismatch. Hybridization conditions can be 

adjusted by one skilled in the art in order to 
accommodate these variables and allow DNAs of different 
sequence relatedness to form hybrids. The melting 
15 temperature of a perfectly matched DNA duplex can be 
estimated by the following equation: 

T ro (°C) « 81.5 + 16.6 (log [Na+) ) + 0.41 (%G+C) - 600/N - 

0.72 ("%f ormamide) 

where N is the length of the duplex formed, [Na+] 
2 0 is the molar concentration of the* sodium ion in the 
hybridization or washing solution, %G+C is the 
percentage of (guanine+cytosine) bases in the hybrid. 
For imperfectly matched hybrids, the melting 
temperature is reduced by approximately 1°C for each 1% 
25 mismatch. 

The term "moderately stringent conditions" refers 
to conditions under which a DNA duplex with a greater 
degree of base pair mismatching than could occur under 
"highly stringent conditions" is able to form. 
30 Examples of typical "moderately stringent conditions" 
are 0.015M sodium chloride, 0.0015M sodium citrate at 
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50-65°C or 0.015M sodium chloride, 0.6015M sodium 
citrate, and 20% formamide at 37-50°C. By way of 
example, a "moderately stringent" condition of 50°C in 
0.015 M sodium ion will allow about a 21% mismatch. 

5 It will be appreciated by those skilled in the art 

that there is no absolute distinction between "highly" 
and "moderately" stringent conditions. For example, at 
0.015M sodium ion (no formamide), the melting 
temperature of perfectly matched long DNA is about 
10 71°C. With a wash at 65°C (at the same ionic 
strength) , this would allow for approximately a 6% 
mismatch. To capture more distantly related sequences, 
one skilled in the art can simply lower the temperature 
or. raise the ionic strength. 

15 A good estimate of the melting temperature in 1M 

NaCl* for oligonucleotide probes up to about 2 0nt is 
given by: 

Tm = 2°C per A-T base pair + 4°C per G-C base pair 

*The sodium ion concentration in 6X salt sodium 
20 citrate (SSC) is 1M. See Suggs et al . , Developmental 
Biology Using Purified Genes, p. 683, Brown and Fox 
(eds. ) (1981) . 

High stringency washing conditions for 
oligonucleotides are usually at a temperature of 0-5°C 
25 below the Tm of the oligonucleotide in 6X SSC, 0.1% 
SDS. 

In another embodiment, related nucleic acid 
molecules comprise or consist of a nucleotide sequence 
that is about 70 percent identical to the nucleotide 
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sequence as shown in SEQ ID NO:l, SEQ ID NO: 3, or SEQ 
ID NO: 6, or comprise or consist essentially of a 
nucleotide sequence encoding a polypeptide that is 
about 70 percent identical to the polypeptide as set 
5 forth in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID NO: 7. In 
preferred embodiments, the nucleotide sequences are 
about 75 percent, or about 80 percent, or about 85 
percent, or about 90 percent, or about 95, 96, 97, 98, 
or 99 percent identical to the nucleotide sequence as 

10 shown in SEQ ID NO:l, SEQ ID NO:3, or SEQ ID NO:6, or 
the nucleotide sequences encode a polypeptide that is 
about 75 percent, or about 80 percent, or about 85 
percent, or about 90 percent, or about 95, 96, 97, 98, 
or 99 percent identical to the polypeptide sequence as 

15 set forth in SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO: 7. 

Differences in the nucleic acid sequence may 
result in conservative and/or non- conservative 
modifications of the amino acid sequence relative to 
the amino acid sequence of SEQ ID NO: 2, SEQ ID N0:4, or 
20 SEQ ID NO: 7. 

Conservative modifications to the amino acid 
sequence of SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID NO: 7 
(and the corresponding modifications to the encoding 
nucleotides) will produce C3b/C4b CR-like polypeptides 

25 having functional and chemical characteristics similar 
to those of naturally occurring C3b/C4b CR-like 
polypeptide. In contrast, substantial modifications in 
the functional and/or chemical characteristics of. 
C3b/C4b CR-like polypeptides may be accomplished by 

30 selecting substitutions in the amino acid sequence of 
SEQ ID NO: 2, SEQ ID NO:4, or SEQ ID* NO : 7 that differ 
significantly in their effect on maintaining (a) the 
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'structure of the molecular backbone in the area of the 
substitution, for example, as a sheet or . helical 
conformation, (b) the charge or hydrophobicity of the 

'molecule at the target site, or (c) the bulk of the 
— 5 side chain. 

For example.. a ^conservative amino acid 
substitution" may involve a substitution of a native 
amino acid residue with a nonnative residue such that 
there is little or no effect on the polarity or charge 
10 of the amino acid residue at that position. 
Furthermore, any native residue in the polypeptide may 
also be substituted with alanine, as has been 
previously described for "alanine scanning 
mutagenesis." 

Conservative amino acid substitutions also 
encompass rion-naturaliy occurring amino acid residues 
which are typically incorporated by chemical peptide 
synthesis rather than by synthesis in biological 
20 systems. These include peptidomimetics , and other 
reversed or inverted forms of amino acid moieties. 

Naturally occurring residues may be divided into 
classes based on common side chain properties: 

1) hydrophobic: norleucine, Met, Ala, Val, Leu, 
25 lie; 

2) neutral hydrophilic: Cys, Ser, Thr, Asn, 
Gin; 

3) acidic: Asp, Glu; 

4) basic: His, Lys, Arg; 

3 0 5) residues that influence chain orientation: 

Gly, Pro; and 
6) aromatic: Trp, Tyr, Phe. 
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For example, non- conservative substitutions may 
involve the exchange of a member of one of these 
classes for a member from another class. Such 
substituted residues may be introduced into regions of 
5 the human C3b/C4b CR-like polypeptide that are 
homologous with non-human C3b/C4b CR-like polypeptide 
orthologs, or into the non- homologous regions of the 
molecule . 

In making such changes, the hydropathic index of 
10 amino acids may be considered. Each amino acid has ' 

been assigned a hydropathic index on the basis of their 

hydrophobicity and charge characteristics, these are: 

isoleucine (+4.5); valine (+4.2); leucine (+3.8); 

phenylalanine (+2.8); cysteine/cystine (+2.5); 

15 methionine (+1.9); alanine (+1.8); glycine (-0.4); 

threonine (-0.7); serine (-0.8); tryptophan (-0.9); 

tyrosine (-1.3); proline (-1.6); histidine (-3.2); 

glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); 

asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

20 The importance of the hydropathic amino acid index 

in conferring interactive biological function on a 
protein is understood in the art. Kyte et al . , J. Mol . 
Biol., 157 : 105-131 (1982). It is. known that certain 
amino acids may be substituted for other amino acids 

25 having a similar hydropathic index or score and still 
retain a similar biological activity. In making 
changes based upon the hydropathic index, the 
substitution of amino acids whose hydropathic indices 
are within ±2 is preferred, those which are within +1 

30 are particularly preferred, and those within ±0.5 are 
even more particularly preferred. 
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~ J It is also understood in the art that the 

substitution of like amino acids can be made 
effectively on the basis of hydrophilicity, 
.particularly where the biologically functionally 

-.5 equivalent protein or peptide thereby created is 
intended for use in immunological embodiments, as in 
the present case. The greatest local average 

hydrophilicity of a protein, as governed by the 
hydrophilicity of its adjacent amino acids, correlates 

10 with its immunogenicity and antigenicity, i.e., with, a 
biological property of the protein. 

The following hydrophilicity values have been 
assigned to amino acid residues: arginine (+3.0); 
lysine (+3.0); aspartate (+3.0 ± 1); glutamate (+3.0 ± 

15 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); 
glycine (0); threonine (-0.4); proline (-0.5 ± 1); 
alanine (-0.5); histidine (-0.5); cysteine (-1.0); 
methionine .(-1.3); valine (-1.5); leucine (-1.8); 
isoleucine (-1.8); tyrosine (-2.3); phenylalanine (- 

20 2,5); tryptophan (-3.4). In making changes based upon 
similar hydrophilicity values, the substitution of 
amino acids whose hydrophilicity values are within ±2 
is preferred, those which are within ±1 are 
particularly preferred, and those within ±0.5 are even 

25 more particularly preferred. One may also identify 
epitopes from primary amino acid sequences on the basis 
of hydrophilicity. These regions are also referred to 
as "epitopic core regions. " 

Desired amino acid substitutions (whether 
30 conservative or non-conservative) can be determined by 
those skilled in the art at the time such substitutions 
are desired. For example, amino acid substitutions can 



29 



WO 02/10199 



PCT/US01/23232 



t>e used to identify important residues of the C3b/C4b 
CR-like polypeptide, or to increase or decrease the 
affinity of the C3b/C4b CR-like polypeptides described 
'herein. 

Exemplary amino acid substitutions are set forth 
in Table I . 
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Table I 
Amino Acid Substitutions 



Original 
Kesiaues 


Exemplary 

bUDSCltUtlOnS 


Preferred 
Substitutions 


Ala 


Val, Leu, He 


Val 


Arg 


Lys, Gin, Asn 


Lys 


Asn 


Gin 


Gin 


Asp 


Glu 


Glu 


Cys 


Ser, Ala 


Ser 


Gin 


Asn 


Asn 


Glu 


Asp 


Asp 


Gly 


Pro, Ala 


Ala 


His 


Asn, Gin, Lys, Arg 


Arg 


He 


Leu, Val, Met, Ala, 
Phe, Norleucine 


Leu 


Leu 


Norleucine, He, 
Val, Met, Ala, Phe 


He 


Lys 


Arg, 1,4 Diamino- 
butyric Acid, Gin, 
Asn 


Arg 


Met 


Leu, Phe, He 


Leu 


Phe 


Leu, Val, He, Ala, 
Tyr 


Leu 


Pro 


Ala 


Gly 


Ser 


Thr, Ala, Cys 


Thr 


Thr 


Ser 


Ser 


Trp 


Tyr, Phe 


Tyr 


Tyr 


Trp, Phe, Thr, Ser 


Phe 


Val 


He, Met, Leu, Phe, 
Ala, Norleucine 


Leu 
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A skilled artisan will be able to determine 
suitable variants of the polypeptide as set forth in 
SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID NO : 7 using well 
known techniques. For identifying suitable areas of 
5 the molecule that may be changed without destroying 
activity, one skilled in the art may target areas not 
believed to be important for activity. For example, 
when similar polypeptides with similar activities from 
the same species or from other species are known, one 

10 skilled in the art may compare the amino acid sequence 
of a C3b/C4b CR-like polypeptide to such similar 
polypeptides. With such a comparison, one can identify 
residues and portions of the molecules that are 
conserved among similar polypeptides. It will be 

15 appreciated that changes in areas of a C3b/C4b CR-like 
polypeptide that are not conserved relative to such 
similar polypeptides would be less likely to adversely 
affect the biological activity and/or structure of the 
C3b/C4b CR-like polypeptide. One skilled in the art 

2 0 would also know that, even in relatively conserved 
regions, one may substitute chemically similar amino 
acids for the naturally occurring residues while 
retaining activity (conservative amino acid residue 
substitutions) . Therefore, even areas that may be 

2 5 important for biological activity or for structure may 
be subject to conservative amino acid substitutions 
without destroying the biological activity or without 
adversely affecting the polypeptide structure. 

Additionally, one skilled in the art can review 
30 structure-function studies identifying residues in 
similar polypeptides that are important for activity or 
structure. In view of such a comparison, one can 
predict the importance of amino acid residues in a 
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'C3b/C4b CR-like polypeptide that correspond to amino 
acid residues that are important for activity or 
structure in similar polypeptides. One skilled in the 
'art may opt for chemically similar amino acid 
-5 substitutions for such predicted important amino acid 
residues of C3b/C4b CR-like polypeptides* 

One skilled in the art can also analyze the three- 
dimensional structure and amino acid sequence in 
relation to that structure in similar polypeptides. In 

10 view of that information, one skilled in the art may 
predict the alignment of amino acid residues of a 
C3b/C4b CR-like polypeptide with respect to its three 
dimensional structure. One skilled in the art may 
choose not to make radical changes to amino acid 

15 residues predicted to be on the surface of the protein, 
since such residues may be involved in important 
interactions with other molecules. Moreover, one 
skilled in the art may generate test variants 
containing a single amino acid substitution at each 

20 desired amino acid residue. The variants can then- be 
screened using activity assays know to those skilled in 
the art. Such variants could be used to gather 
information about suitable variants. For example, if 
one discovered that a change to a particular amino acid 

25 residue resulted in destroyed, undesirably reduced, or 
unsuitable activity, variants with such a change would 
be avoided. In other words, based on information 
gathered from such routine experiments, one skilled in 
the art can readily determine the amino acids where 

30 further substitutions should be avoided either alone or 
in combination with other mutations. 
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~~ ' A number of scientific publications have been 

devoted to the prediction of secondary structure. See 
Moult J., Curr. Op. in Biotech., 7 (4) -.422-427 (1996), 
'Chou et al., Biochemistry, 1 3 (2) :222-245 (1974); Chou 

-5 et al., Biochemistry, 113 (2) :211-222 (1974); Chou et 
al., Adv. Enzymol. Relat. Areas Mol. Biol., 47 :45-148 
(1978); Chou et al . , Ann. Rev. Biochem. , 47:251-276 and 
Chou et al., Biophys. J., 2^5:367-384 (1979). Moreover, 
computer programs are currently available to assist 

10 with predicting secondary structure. One method of 
predicting secondary structure is based upon homology 
modeling. For example, two polypeptides or proteins 
which have a sequence identity of greater than 30%, or 
similarity greater than 40% often have similar 

15 structural topologies. The recent growth of the 
protein structural data base (PBB) has provided 
enhanced predictability of secondary structure, 
including the potential number of folds within a 
polypeptide's or protein's structure. See Holm et al., 

20 Nucl. Acid. Res., 27 (1) :244-247 (1999). It has been 
suggested (Brenner et al., Curr. Op. -Struct. Biol., 
1 (3) :369-376 (1997)) that there are a limited number of 
folds in a given polypeptide or protein and that once a 
critical number of structures have been resolved, 

25 structural prediction will gain dramatically in 
accuracy. 

Additional methods of predicting secondary 
structure include * threading" (Jones, D. , Curr. Opin. 
Struct. Biol., 7 (3) :377-87 (1997); Sippl et al . , 
30 Structure, 4 (1) : 15-9 (1996)), "profile analysis" (Bowie 
et al., Science, 253 :164-170 (1991); Gribskov et al., 
Meth. Enzym. , 183 : 146-159 (1990); Gribskov et al., 
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Proc. Nat. Acad. Sci . , 84 (13) :4355-4358 (1987)), and 
"evolutionary linkage" (See Home, supra, and Brenner, 
supra) . 

Preferred C3b/C4b CR-like polypeptide variants 
5 include glycosylation variants wherein the number 
and/or type of glycosylation sites has been altered 
compared to the amino acid sequence set forth in SEQ ID 
NO: 2, SEQ ID NO:4, or SEQ ID NO: 7. In one embodiment, 
C3b/C4b CR-like polypeptide variants comprise a greater 

10 or a lesser number of N-linked glycosylation sites than 
the amino acid sequence set forth in SEQ ID NO: 2, SEQ 
ID NO : 4 , or SEQ ID NO : 7 . An N-linked glycosylation 
site is characterized by the sequence: Asn-X-Ser or 
Asn-X-Thr, wherein the amino acid residue designated as 

15 X" may be any amino acid residue except proline. The 
substitution (s) . of amino acid residues to create this 
sequence provides a potential new site for the addition 
of an N-linked carbohydrate chain. Alternatively, 
substitutions which eliminate this sequence will remove 

2 0 an existing N-linked carbohydrate chain. Also provided 

is a rearrangement of N-linked carbohydrate chains 
wherein one or more N-linked glycosylation sites 
(typically those that are naturally occurring) are 
eliminated and one or more new N-linked sites are 
25 created. Additional preferred C3b/C4b CR-like variants 
include cysteine variants, wherein one or more cysteine 
residues are deleted from or substituted for another 
amino acid (e.g., serine) as compared to the amino 
acid sequence set forth in SEQ ID NO: 2, SEQ ID NO: 4, or 

3 0 SEQ ID NO: 7. Cysteine variants are useful when C3b/C4b 

CR-like polypeptides must be refolded into a 
biologically active conformation such as after the 
isolation of insoluble inclusion bodies. Cysteine 
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Variants generally have fewer cysteine residues than 
the native protein, and typically have an even number 
to minimize interactions resulting from unpaired 

'cysteines. 

5 In addition, the polypeptide comprising the amino 

acid sequence of SEQ ID NO: 2, SEQ ID N0:4, or SEQ ID 
NOzl or a C3b/C4b CR-like polypeptide variant may be 
fused to a homologous polypeptide to form a homodimer 
or to a heterologous polypeptide to form a heterodimer. 
.10 Heterologous peptides and polypeptides include, but are 
not limited to: an epitope to allow for the detection 
and/or isolation of a C3b/C4b CR-like fusion 
polypeptide; a transmembrane receptor protein or a 
portion thereof, such as an extracellular domain, or a 

15 transmembrane and intracellular domain; a ligand or a 
portion thereof which binds to a transmembrane receptor 
protein; an enzyme or portion thereof which is 
catalytically active; a polypeptide or peptide which 
promotes oligomerization, such as a leucine zipper 

20 domain; a polypeptide or peptide which increases 
stability, such as an immunoglobulin constant region; 
and a polypeptide which has a therapeutic activity 
different from the polypeptide comprising the amino 
acid sequence as set forth in SEQ ID NO: 2, SEQ ID NO: 4 , 

25 or SEQ ID .NO: 7, or a C3b/C4b CR-like polypeptide 
variant . 

Fusions can be made either at the. amino terminus 
or at the carboxy terminus of the polypeptide 
comprising the amino acid sequence set forth in SEQ ID 
30 NO: 2, SEQ ID NO:4, or SEQ ID NO: 7, or a C3b/C4b CR-like 
polypeptide variant. Fusions may be direct with no 
linker or adapter molecule or indirect using a linker 
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or adapter molecule. A linker or adapter molecule may 
be one or more amino acid residues, typically up to 
about 20 to about 50 amino acid residues. A linker or 
adapter molecule may also be designed, with a cleavage 
5 site for a DNA restriction endonuclease or for a 
protease to allow for the separation of the fused 
moieties. It will be appreciated that once 

constructed, the fusion polypeptides can be derivatized 
according to the methods described herein. 

1 0 In a further embodiment of the invent ion , the 

polypeptide comprising the amino acid sequence of SEQ 
ID NO:2, SEQ ID NO:4, or' SEQ ID NO:7, or a C3b/C4b CR- 
like polypeptide variant is fused to one or more 
domains of an Fc region of human IgG. Antibodies 

15 ' comprise two functionally independent parts, a variable 
domain known as "Fab", which binds antigen, and a 
constant domain known as n Fc" , which is involved in 
effector functions such as complement activation and 
attack by phagocytic cells. An Fc has a long serum 

2 0 half -life, whereas an Fab is short-lived. Capon et 
al., Nature, 337 :525-31 (1989). When constructed 
together with a therapeutic protein, an Fc domain can 
provide longer half-life or incorporate such functions 
as Fc receptor binding, protein A binding, complement 

25 fixation and perhaps even placental transfer. Id. 
Table II summarizes the use of certain Fc fusions known 
in the art . 
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TABLE II: FC FUSION WITH THERAPEUTIC PROTEINS 



Form of Fc 


Fusion 


Therapeutic 






partner 


implications 


Reference 


IgGl 


N- terminus 


Hodgkin's 


U.S. Patent No. 




of CD30-L 


disease; 
anaplastic 
lymphoma ; T-cell 
leukemia 


5,480, 981 


Murine 
Fey 2 a 


IL-10 


anti- 
inflammatory; 
transplant 
rejection 


Zheng et al . 
(1995) , J*. 
Immunol . , 154 : 
5590-5600 


IgGl 


TNF 
receptor 


septic shock 

• 


Fisher et al. 
(1996) , N. Engl. 
J. Med., 334: 
1697-1702; Van 
Zee et al . , 
(1996) , J. 
Immunol . , 156 : 
2221-2230 


IgG, IgA, 


TNF 


inflammation, 


U.S. Pat. No. • 


IgM, or 
IgE 
(excluding 
the • first 


receptor 


autoimmune 
disorders 


5,808,029, issuec 
September 15, '•: 
1998 


domain) 








IgGl 


CD4 
receptor 


AIDS 


Capon et al . 
(1989) , Nature 
337: 525-531 


IgGl, 


N-terminus 


anti-cancer, 


Harvill et al. 


IgG3 


of IL-2 


antiviral 


(1995) , 

Immunotech. , 1 : 
95-105 


IgGl 


C- terminus 
of OPG 


osteoarthritis ; 
bone density 


WO 97/23614 , 
published July 3, 
1997 


IgGl 


N- terminus 
of leptin 


ant i -obesity 


PCT/US 97/23183, 
filed December 
11, 1997 


Human Ig 
Cyl 


CTLA-4 


autoimmune 
disorders 


Linsley (1991) , 
J". Exp . Med. , 






174:561-569 



In one example, all or a portion of the human IgG 
hinge, CH2 and CH3 regions may be fused at either the 
5 N-terminus or C- terminus of the C3b/C4b CR-like 
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polypeptides using methods known to the skilled 
artisan. The resulting C3b/C4b CR-like fusion 

polypeptide may be purified by use of a Protein A 
affinity column. Peptides and proteins fused to an Fc 
.,..5 ( region have been found' to exhibit a substantially 
greater half -life in vivo than the unfused counterpart. 
Also, a fusion to an Fc region allows for 
dimerization/multimerization of the fusion polypeptide. 
The Fc region may be a naturally uccurx*ing Fc region, 
10 or may be altered to improve certain qualities, such as 

, therapeutic qualities, circulation time, reduce 
aggregation, etc. 

Identity and similarity of related nucleic acid 
molecules and polypeptides can be readily calculated by 

15 known methods. Such methods include, but are not 
limited to, those described in Computational Molecular 
Biology, Lesk, A.M., ed. , Oxford University Press, New 
York, 1988; Biocomputing : Informatics and Genome 
Projects, Smith, D.W., ed., Academic Press, New York, 

20 1993;' Computer Analysis of Sequence Data, Part 1, 
Griffin, A.M., and Griffin, H.G. , eds . , Humana Press, 
New Jersey, 1994; Sequence Analysis in Molecular 
Biology, von Heinje, G., Academic Press, 1987; Sequence 
Analysis Primer, Gribskov, M. and Devereux, J., eds., 

25 M. Stockton Press, New York, 1991; and Carillo et al . , 
SIAM J". Applied Math., 48:1073 (1988) . 

Preferred methods to determine identity and/or 
similarity are designed to give the largest match 
between the sequences tested. Methods to determine 

30 identity and similarity are described in publicly 
available computer programs. Preferred computer 

program methods to determine identity and similarity 
between two sequences include, but are not limited to, 
the GCG program package, including GAP (Devereux et 

35 al . , Nucl. Acid. -Res., 12:387 (1984) ; Genetics Computer 
Group, University of Wisconsin, Madison, WI) , BLASTP, 
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* BLASTN, and FASTA (Altschul et al . , J. Mol . Biol., 
215:403-410 (1990)), The BLASTX program is publicly 
available from the National Center for Biotechnology 
'Information (NCBI) and other sources {BLAST Manual, 
"5 Altschul et al. NCB/NLM/NIH Bethesda, MD 20894; 
Altschul et al., supra). The well known Smith Waterman 
algorithm may also be used to determine identity. 

Certain alignment schemes for aligning two amino 
acid sequences may result in the matching of only a 

10 short region of the two sequences, and this small 
aligned region may have very high sequence identity 
even though there is no significant relationship 
between the two full length sequences. Accordingly, in 
a preferred embodiment, the selected alignment method 

15- (GAP program) will result in an alignment: that spans at 
least 50 contiguous amino acids of the target 
polypeptide . 

For example, using the computer algorithm GAP 
(Genetics Computer Group, University of Wisconsin, 

20 Madison, WI) , two polypeptides for which the percent 
sequence identity is to be determined are aligned for 
optimal matching of their respective amino acids (the 
"matched span" , as determined by the algorithm) . A gap 
opening penalty (which is calculated as 3X the average 

25 diagonal; the "average diagonal" is the average of the 
diagonal of the comparison matrix being used; the 
"diagonal" is the score or number assigned, to each 
perfect amino acid match by the particular comparison 
matrix) and a gap extension penalty (which is usually 

3 0 1/10 times the gap opening penalty) , as well as a 
comparison matrix such as PAM 250 or BLOSUM 62 are used 
in conjunction with the algorithm. A standard 
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comparison matrix (see Dayhoff et al . , Atlas of Protein 
Sequence and Structure, vol. 5, supp.3 (1978) for the 
PAM 250 comparison matrix; Henikoff et al . , Proc. Natl. 
Acad. Sci USA, 89:10915-10919 (1992) for the BLOSUM 62 
5 comparison matrix) is also used by the algoxithm. 

Preferred parameters for a polypeptide sequence 
comparison include the following: 

Algorithm: . Needleman et al . , J". Mol . Biol . , 
48:443-453 (1970); 

Comparison matrix: BLOSUM 62 from Henikoff et 
al., Proc. Natl. Acad. Sci. USA, 89:10915-10919 
(1992); 

Gap Penalty: 12 
Gap Length Penalty: 4 
Threshold of Similarity: 0 

The GAP program " is useful with the above 
parameters. The aforementioned parameters are the 
default parameters for polypeptide comparisons (along 
20 with no penalty for end gaps) using the GAP algorithm. 

Preferred parameters for nucleic acid molecule 
sequence comparisons include the following: 

Algorithm: Needleman et al., J*. Mol Biol., 48:443- 
453 (1970) ; 

25 Comparison matrix: matches = +10, mismatch = 0 

Gap Penalty: 50 
Gap Length Penalty: 3 

The GAP program is also useful with the above 
3 0 parameters. The aforementioned parameters are the 



10 



15. 
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default parameters for nucleic acid molecule 
comparisons . 

Other exemplary algorithms, gap opening penalties, 
gap extension penalties, comparison matrices, 
5 thresholds of similarity, etc . may be used, including 
those set forth in the Program Manual, Wisconsin 
Package, Version 9, September, 1997- The particular 
choices to be made will be apparent to those of skill 
in the art and will depend on the specific comparison 

10 to be made, such as DNA to DNA, protein to protein, 
protein to DNA; and additionally, whether the 
comparison is between given pairs of sequences (in 
which case GAP or BestF.it are generally preferred) or 
between one sequence and a large database of sequences 

15 (in which case FASTA or BLASTA are preferred) . 

Synthesis 

It will be appreciated by those skilled in the art 
the nucleic acid and polypeptide molecules described 
20 herein may be produced by recombinant and other means. 

Nucleic Acid Molecules 

The nucleic acid molecules encode a polypeptide 
comprising the amino acid sequence of a C3b/C4b CR-like 
25 polypeptide can readily be obtained in a variety of 
ways including, without limitation, chemical synthesis, 
cDNA or genomic library screening, expression library 
screening and/or PCR amplification of cDNA. 

Recombinant DNA methods used herein are generally 
30 those set forth in Sambrook et al . , Molecular Cloning: 
A Laboratory Manual, Cold Spring Harbor Laboratory 
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J Press, Cold Spring Harbor, NY (1989), and/or Ausubel et 
al . , eds. # Current Protocols in Molecular Biology, 
Green Publishers Inc. and Wiley and Sons, NY (1994) . 
'The present invention provides for nucleic acid 
5 molecules as described herein and methods for obtaining 
the molecules. 

Where a gene encoding the amino acid sequence of a 
C3b/C4b CR-like polypeptide has been identified from 
one species, all or a portion of that gene may be used 

10 as a probe to identify orthologs or related genes from 
the same species. The probes or primers may be used to 
screen cDNA libraries from various tissue sources 
believed to express the C3b/C4b CR-like polypeptide. 
In addition, part or all of a nucleic acid molecule 

15 having the sequence as set forth in SEQ ID NO:l, SEQ ID 
NO: 3, or SEQ ID NO: 6 may be used to screen a genomic 
library to identify and isolate a gene encoding the 
amino acid sequence of a C3b/C4b CR-like polypeptide. 
Typically, conditions of moderate or high stringency 

20 will be employed for screening to minimize the number 
of false positives obtained from the screen.. 

Nucleic acid molecules encoding the amino acid 
sequence of C3b/C4b CR-like polypeptides may also be 
identified by expression cloning which employs the 

25 detection of positive clones based upon a property of 
the expressed protein. Typically, nucleic acid 

libraries are screened by the binding of an antibody or 
other binding partner (e.g., receptor or ligand) to 
cloned proteins which are expressed and displayed on a 

30 host cell surface. The antibody or binding partner is 
modified with a detectable label to identify those 
cells expressing the desired clone. 
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Recombinant expression techniques conducted in 
accordance with the descriptions set forth below may be 
followed to produce these polynucleotides and to 
express the encoded polypeptides. For example, by 
5 inserting a nucleic acid sequence which encodes the 
amino acid sequence of a C3b/C4b CR-like polypeptide 
into an appropriate vector, one skilled in the art can 
readily produce large quantities of the desired 
nucleotide sequence. The sequences can then be used to 

10 generate detection probes or amplification primers. 
Alternatively, a polynucleotide encoding the amino acid 
sequence of a C3b/C4b CR-like polypeptide can be 
inserted into an expression vector. By introducing the 
expression vector into an appropriate host, the encoded 

15 C3b/C4b CR-like polypeptide may be produced in large 
amounts . 

Another method for obtaining a suitable nucleic 
acid' sequence is the polymerase chain reaction (PCR) . 
In this method, cDNA is prepared from poly (A) +RNA or 

20 total RNA using the enzyme reverse transcriptase. Two 
primers, typically complementary to two separate 
regions of cDNA (oligonucleotides) encoding the amino 
acid sequence of a C3b/C4b CR-like polypeptide, are 
then added to the cDNA along with a polymerase such as 

25 Tag polymerase, and the polymerase amplifies the cDNA 
region between the two primers. 

Another means of preparing a nucleic acid molecule 
encoding the amino acid sequence of a C3b/C4b CR-like 
polypeptide is chemical synthesis using methods well 
3 0 known to the skilled artisan such as those described by 
Engels et al . , Angew. Chem. Intl. Ed., 28:716-734 
(1989) . These methods include, inter alia, the 
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'phosphotriester, phosphoramidite, and H-phosphonate 
methods for nucleic acid synthesis. A preferred method 
for such chemical synthesis is polymer- supported 
'synthesis using standard phosphoramidite chemistry, 

—5 Typically, the DNA encoding the amino acid sequence of 
a C3b/C4b CR-like polypeptide will be several hundred 
nucleotides in length. Nucleic acids larger than about 
^100 nucleotides can be synthesized as several fragments 
using these methods. The fragments can then be ligated 

10 together to form the full length nucleotide sequence of 
a C3b/C4b CR-like polypeptide. Usually , the DNA 
fragment encoding the amino terminus of the polypeptide 
will have an ATG, which . encodes a methionine residue. 
This methionine may or may not be present on the mature 
.15 form of the C3b/C4b CR-like polypeptide, depending on 
whether the polypeptide produced in the host cell is 
designed to be secreted from that cell. Other methods 
known to the skilled artisan may be used as well. 

In certain embodiments, nucleic acid variants 

2 0 contain codons which have been altered for the optimal 
expression of a C3b/C4b CR-like polypeptide in a given 
host cell. Particular codon alterations will depend 
upon the C3b/C4b CR-like polypeptide (s) and host 
cell (s) selected for expression. Such "codon 

25 optimization" can be carried out by a variety of 
methods, for example, by selecting codons which are 
preferred for use in highly expressed genes in a given 
host cell. Computer algorithms which incorporate codon 
frequency tables such as "Ecohigh. cod" for codon 

30 preference of highly expressed bacterial genes may be 
used and are provided by the University of Wisconsin 
Package Version 9.0, Genetics Computer Group, Madison, 
WI . Other useful codon frequency tables include 
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' "Celegans_high. cod" , "Celegans_low . cod" , 

"Drosophila_high.cod" , "Humanjiigh. cod" , 

"Mai ze_high . cod" , and "Yeas t_high . cod" . 

*■ »• * i 

Vectors and Host Cells 

5 A nucleic acid molecule encoding the amino acid 

sequence of a C3b/C4b CR-like polypeptide may be 
inserted into an appropriate expression vector using 
standard ligation techniques. The vector is typically 
selected to be functional in the particular host cell 

10 employed (i.e., the vector is compatible with the host 
cell machinery such that' amplification of the gene 
and/or expression of the gene can occur) . A nucleic 
acid molecule encoding the amino acid sequence of a 
C3b/C4b CR-like polypeptide may be amplified/expressed 

15 in prokaryotic, yeast, insect (baculovirus systems) , 
and/or eukaryotic host cells. Selection of the host 
cell will depend in part on whether a C3b/C4b CR-like 
polypeptide is to be post-translationally modified 
(e.g., glycosylated and/or phosphorylated) . If so, 

20 yeast, insect, or mammalian host cells are preferable. 
For a review of expression vectors, see Math. Enz., 
v.18 5, D.V. Goeddel, ed. Academic Press Inc., San 
Diego, CA (1990) . 

Typically, expression vectors used in any of the 
25 host cells will contain sequences for plasmid 
maintenance and for cloning and expression of exogenous 
nucleotide sequences. Such sequences, collectively 
referred to as "flanking sequences" in certain 
embodiments will typically include one or more of the 
30 following nucleotide sequences: a promoter, one or more 
enhancer sequences, an origin of replication, a 
transcriptional termination sequence, a complete intron 
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sequence containing a donor and acceptor splice site, a 
sequence encoding a leader sequence for polypeptide 
secretion, a ribosome binding site, a polyadenylation 
sequence, a polylinker region for inserting the nucleic 
5 acid encoding the polypeptide to be expressed, and a 
selectable marker element. Each of these sequences is 
discussed below. 

Optionally, the vector may contain a 11 tag 11 - 
encoding sequence, i.e., an oligonucleotide molecule 

10 located at the 5 1 or 3 end of the C3b/C4b CR-like 
polypeptide coding sequence; the oligonucleotide 
sequence encodes polyHis (such as hexaHis) , or other 
"tag" such as FLAG, HA (hemaglutinin Influenza virus) 
or* myc for which commercially available antibodies 

15 exist. This tag is typically fused to the polypeptide 
upon expression of the polypeptide, and can serve as a 
means for affinity purification of the C3b/C4b CR-like 
polypeptide from the host cell. Affinity purification 
can be accomplished, for example, by column 

20 chromatography using antibodies against the tag as an 
affinity matrix. Optionally, the tag can subsequently 
be removed from the purified C3b/C4b CR-like 
polypeptide by various means such as using certain 
peptidases for cleavage. 

25 Flanking sequences may be homologous, (i.e., from 

the same species and/or strain as the host cell) , 
heterologous (i.e., from a species other than the host 
cell species or strain) , hybrid (i.e., a combination of 
flanking sequences from more than one source) or 

30 synthetic, or the flanking sequences may be native 
sequences which normally function to regulate C3b/C4b 
CR-like polypeptide expression. As such, the source of 
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a flanking sequence may be any prokaryotic or 
eukaryotic organism, any vertebrate or invertebrate 
organism, or any plant, provided that the flanking 
sequence is functional in, and can be activated by, the 
5 host cell machinery. 

The flanking sequences useful in the vectors of 
this invention may be obtained by any of several 
methods well known in the art. Typically, flanking 
sequences useful herein other than the C3b/C4b CR-like 

10 gene, flanking sequences will have been . previously 
identified by mapping and/or by restriction 
endonuclease digestion and can thus be isolated from 
the proper tissue source using the appropriate 
restriction endonucl eases . In some cases, the full 

15 nucleotide sequence of a flanking sequence may be 
known. Here, the flanking sequence .may be synthesized 
using the methods described herein for nucleic acid 
synthesis or cloning. 

Where all or only a portion of the flanking 
20 sequence is known, it may be obtained using PCR and/or 
by screening a genomic library with suitable 
oligonucleotide and/or flanking sequence fragments from 
the same or another species. Where the flanking 
sequence is not known, a fragment of DNA containing a 
25 flanking sequence may be isolated from a larger piece 
of DNA that may contain, for example, a coding sequence 
or even another gene or genes. Isolation may be 
accomplished by restriction endonuclease digestion to 
produce the proper DNA fragment followed by isolation 
30 using agarose gel purification, Qiagen® column 
chromatography (Chatsworth, CA) , or other methods known 
to the skilled artisan. The selection of suitable 
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5 enzymes to accomplish this purpose will be readily 
apparent to one of ordinary skill in the art. 

An origin of replication is typically a part of 
. those prokaryotic expression vectors purchased 
-5 commercially, and the origin aids in the amplification 
of the vector in a host cell. Amplification of the 
vector to a certain copy number can, in some cases, be 
* important for the optimal expression of a C3b/C4b CR- 
like polypeptide. If the vector of choice does not 
10 contain an origin of replication site, one may be 
chemically synthesized based on a known sequence, and 
ligated into the vector. For example, the origin of 
replication from the plasmid pBR322 (Product No. 303- 
. 3s, New England Biolabs, Beverly, MA) is suitable for 
15 most Gram-negative bacteria and various origins (e.g., 
SV40, polyoma, adenovirus, vesicular stomatitus virus 
(VSV) or papillomaviruses such, as HPV or BPV) are 
useful for cloning vectors in mammalian cells. 
Generally, the origin of replication component is not 
20 needed for mammalian expression vectors (for example, 
the SV4 0 origin is often used only because it contains 
the early promoter) . 

A transcription termination sequence is typically 
iocated 3 1 of the end of a polypeptide coding region 

25 and serves to terminate transcription. Usually, a 
transcription termination sequence in prokaryotic cells 
is a G-C rich fragment followed by a poly T sequence. 
While the sequence is easily cloned from a library or 
even purchased commercially as part of a vector, it can 

30 also, be readily synthesized using methods for nucleic 
acid synthesis such as those described herein. 

A selectable marker gene element encodes a protein 
necessary for the survival and growth of a host cell 
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'grown in a selective culture medium. Typical selection 
marker genes encode proteins that (a) confer resistance 
to antibiotics or other toxins, e.g., ampicillin, 
'tetracycline, or kanamycin for prokaryotic host cells, 

"5 (b) complement auxotrophic deficiencies of the cell; or 
(c) supply critical nutrients not available from 
complex media. Preferred selectable markers are the 
kanamycin resistance gene, the ampicillin resistance 
gene, and the tetracycline resistance gene. A neomycin 

10 resistance gene may also be used for selection in 
prokaryotic and eukaryotic host cells. 

Other selection genes may be used to amplify the 
gene which will be expressed. Amplification is the 
process wherein genes which are in greater demand for 

15 : the production of a protein critical for growth are 
reiterated in tandem within the chromosomes of 
successive generations of recombinant cells. Examples 
of suitable selectable markers for mammalian cells 
include dihydrof olate reductase (DHFR) and thymidine 

20 kinase. The mammalian cell transf ormants are placed 
under selection pressure which only the transf ormants 
are uniquely adapted to survive by virtue of the 
selection gene present in the vector. Selection 
pressure is imposed by culturing the transformed cells 

25 under conditions in which the concentration of 
selection agent in the medium is successively changed, 
thereby leading to the amplification of both the 
selection gene and the DNA that encodes a C3b/C4b CR- 
like polypeptide. As a result, increased quantities of 

3C C3b/C4b CR-like polypeptide are synthesized from the 
amplified DNA. 

A ribosome binding site is usually necessary for 
translation initiation of mRNA and is characterized by 
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a Shine -Dalgarno sequence (prokaryotes) or a Kozak 
sequence (eukaryotes) . The element is typically 
located 3 1 to the promoter and 5 1 to the coding 
sequence of a C3b/C4b CR-like polypeptide to be 
5 expressed. The. Shine-Dalgarno sequence is varied but 
is typically a polypurine (i.e., having a high A-G 
content) . Many Shine-Dalgarno sequences have been 
identified, each of which can be readily synthesized 
using methods set forth herein and \ased in a 

10 prokaryotic vector. 

A leader, or signal, sequence may be used to 
direct a C3b/C4b CR-like polypeptide out of the host 
cell.. Typically, a nucleotide sequence encoding the 
signal sequence is positioned in the coding region of a 

15 C3b/C4b CR-like nucleic acid molecule, or directly at 
the 5 1 end of a C3b/C4b CR-like polypeptide coding 
region. Many signal sequences have been identified, 
and any of those that are functional in the selected 
host cell may be used in. conjunction with a C3b/C4b CR- 

2 0 like nucleic acid molecule. Therefore, a signal 

sequence may be homologous (naturally occurring) or 
heterologous to a C3b/C4b CR-like gene or cDNA. 
Additionally, a signal sequence may be chemically 
synthesized using methods described herein. In most 
25 cases , the secretion of a C3b/C4b CR-like polypeptide 
from the host cell via the presence of a signal peptide 
will result in the removal of the signal peptide from 
the secreted C3b/C4b CR-like polypeptide. The signal 
sequence may be a component of the vector, or it may be 

3 0 a part of a C3b/C4b CR-like nucleic acid molecule that 

is inserted into the vector. 

Included within the scope of this invention is the 
use of either a nucleotide sequence encoding a native 
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C3b/C4b CR-like polypeptide signal sequence joined to a 
C3b/C4b CR-like polypeptide coding region or a 
nucleotide sequence encoding a heterologous signal 
'sequence joined to a C3b/C4b CR-like polypeptide coding 

"5 region- The heterologous signal sequence selected 
should be one that is recognized and processed, i.e., 
cleaved by a signal peptidase, by the host cell. For 
prokaryotic host cells that do not recognize and 
process the native C3B/C4B CR-like polypeptide signal 

10 sequence, the signal sequence is substituted by a 
prokaryotic signal sequence selected, for example, from 
the group of the alkaline phosphatase, penicillinase, 
or heat-stable enterotoxin II leaders. For yeast 
secretion, the native C3B/C4B CR-like polypeptide 

15 signal sequence may be substituted by the yeast 
invertase, alpha factor, or acid phosphatase leaders. 
In mammalian cell expression the native signal sequence 
is satisfactory, although other mammalian signal 
sequences may be suitable. 

2 0 In some cases, such as where glycosylation is 

desired in a eukaryotic host cell expression system, 
one may manipulate the various presequences to improve 
glycosylation or yield. For example, one may alter the 
peptidase cleavage site of a particular signal peptide, 
25 or add presequences, which also may affect 
glycosylation. The final protein product may have, in 
the -1 position (relative to the first amino acid of 
the mature protein) one or more additional amino acids 
incident to expression, which may not have been totally 

3 0 removed. For example, the final protein product may 

have one or two amino acid residues found in the 
peptidase cleavage site, attached to the N-terminus . 
Alternatively, use of some enzyme cleavage sites may 
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result in a slightly truncated form of the desired 
C3b/C4b CR-like polypeptide, if the enzyme cuts at such 
area within the mature polypeptide. 

In many cases, transcription of a nucleic acid 
5 molecule is increased by the presence of one or more 
introns in the vector; this is particularly true where 
a polypeptide is produced in eukaryotic. host cells, 
especially mammalian host cells. The introns used may 
be naturally occurring within the C3b/C4b CR-like gene, 

.10 especially where the gene used is a full length genomic 
sequence or a fragment thereof. Where the intron is 
not naturally occurring within the gene (as for most 
cDNAs) , the intron (s) may be obtained from another 
source. The position of the intron with respect to 

15 flanking sequences and the C3b/C4b CR-like gene is 
generally- important, as the intron must be transcribed 
to be effective. Thus, when a C3b/C4b CR-like cDNA 
molecule is being transcribed, the preferred position 
for the intron .is .3' to the transcription start site, 

20 and 5' to the polyA transcription termination sequence. 
Preferably, the intron or introns will be located on 
one side or the other (i.e., 5' or 3') of the cDNA such 
that it does not interrupt the coding sequence. Any 
intron from any source, including any viral, 

25 prokaryotic and eukaryotic (plant or animal) organisms, 
may be used .to practice this invention, provided that 
it is compatible with the host cell (s) into which it is 
inserted. Also included herein are synthetic introns. 
Optionally, more than one intron may be used in the 

3 0 vector. 

The expression and cloning vectors of the present 
invention will each typically contain a promoter that 
is recognized by the host organism and operably linked 
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to the molecule encoding a C3B/C4B CR-like polypeptide. 
Promoters are untranscribed sequences located upstream 
(5 1 ) to the start codon of a structural gene (generally 
within about 100 to 1000 bp) that control the 
5 transcription of the structural gene* Promoters are 
conventionally grouped into one of two classes, 
inducible promoters and constitutive promoters. 
Inducible promoters initiate increased levels of 
transcription from DNA under their control in response 

10 to some change in culture conditions, such as the 
presence or absence of a nutrient or a change in 
temperature. Constitutive promoters, on the other 
hand, initiate continual gene product production; that 
is, there is little or no control over gene expression. 

.15 A large number of promoters, recognized, by a variety of 
potential host cells, are well known. A suitable 
promoter is operably linked to the DNA encoding a 
C3B/C4B CR-like polypeptide by removing the promoter 
from- the source DNA by restriction enzyme digestion and 

2 0 inserting the desired promoter sequence into the 
vector. The native C3B/C4B CR-like gene promoter 
sequence may be used to direct amplification and/or 
expression of a C3B/C4B CR-like nucleic acid molecule. 
A heterologous promoter is preferred, however, if it 

25. permits greater transcription and higher yields of the 
expressed protein as compared to the native promoter, 
and if it is compatible with the host cell system that 
has been selected for use. 

Promoters suitable for use with prokaryotic hosts 
30 include the beta- lactamase and lactose promoter 
systems; alkaline phosphatase, a tryptophan (trp) 
promoter system; and hybrid promoters such as the tac 
promoter. Other known bacterial promoters are also 
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suitable. Their sequences have been published, thereby 
enabling one skilled in the art to ligate them to the 
desired DNA sequence (s) , using linkers or adapters as 
'needed to supply any useful restriction sites. 

5 Suitable promoters for use with yeast hosts are 

advantageously used with yeast promoters. Suitable 
promoters for use with mammalian host cells are well 
known and include, but are not limited to, those 

10 obtained from the genomes of viruses such as polyoma 
virus, fowlpox virus, adenovirus (such as 
Adenovirus 2) , bovine papilloma virus, avian sarcoma 
virus, cytomegalovirus (CMV) , a retrovirus, hepatitis-B 
virus and most preferably Simian Virus 4 0 (SV4 0) . 

15 Other suitable mammalian promoters include heterologous 
mammalian promoters, e.g., heat-shock promoters and the 
act in promoter. 

Additional promoters which may be of interest in 
controlling C3B/C4B CR-like gene transcription include, 

20 but are not limited to: the SV40 early promoter region 
(Bernoist and Chambon, Nature, 290 ; 304-310 , 1981) ; the 
CMV promoter; the promoter contained in the 3 1 long 
terminal repeat of Rous sarcoma virus (Yamamoto et al . , 
Cell, 22:787-797, 1980); the herpes thymidine kinase 

25 promoter (Wagner et al., Proc. Natl, Acad. Sci. USA, 
^78:144-1445, 1981); the regulatory sequences of the 
metallothionine gene (Brinster et al., Nature, 296 :39- 
42, 1982) ; prokaryotic expression vectors such as the 
beta-lactamase promoter (Villa- Kamaroff, et al . , Proc. 

30 Natl. Acad. Sci. USA, 75:3727-3731, 1978); or the tac 
promoter (DeBoer, et al., Proc. Natl. Acad. Sci. USA, 
80:21-25, 1983) . Also of interest are the following 
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' animal transcriptional control regions, which exhibit 
tissue specificity and have been utilized in transgenic 
animals: the elastase I gene control region which is 
'active in pancreatic acinar cells (Swift et al., Cell, 

""5 38^:639-646, 1984; Ornitz et al . , Cold Spring Harbor 
Symp. Quant. Biol., 50:399-409 (1986); MacDonald, 
Hepatology, _7 :425 ~ 515 ' 1987) ; the insulin gene control 
region which is active in pancreatic beta cells 
(Hanahan, Nature, 315:115-122, . 1985); the 

10 immunoglobulin gene control region which is active in 
lymphoid cells (Grosschedl et. al., Cell, 38 : 647-658 
(1984); Adames et al . , Nature, 318:533-538 (1985); 
Alexander et al • , Mol . Cell. Biol., 7:1436-1444, 1987); 
the mouse mammary tumor virus control region which is 

15 active in testicular, breast, lymphoid and mast cells 
(Leder et al . , . Cell, £5:485-495, 1986); the albumin 
gene control region which is active in liver (Pinkert 
et al-, Genes and Devel . , 3 1 :268-276, .1987); the 
alphaf etcprotein gene control region which is active in 

20 liver (Krumlauf et al., Mol. Cell. Biol., 5:1639-1648, 
1985; Hammer et al . , Science, 235^:53-58, 1987); the 
alpha 1- antitrypsin gene control region which is active 
in the liver (Kelsey et al., Genes and Devel., JL:161- 
171, 1987) ; the beta-globin gene control region which 

25 is active in myeloid cells (Mogram et al., Nature, 
315 :338-340,' 1985; Kollias et al . , Cell, 46:89-94, 
1986) ; the myelin basic protein gene control region 
which is active in oligodendrocyte cells in the brain 
(Readhead et al . , Cell, 48:703-712, 1987); the myosin 

3 0 light chain-2 gene control region which is active in 
skeletal muscle (Sani, Nature, 314:283-286, 1985); and 
the gonadotropic releasing hormone gene control region 
which is active in the hypothalamus (Mason et al . , 
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Science, 234:1372-1378, 1986). 

Ai^ enhancer sequence may be inserted into the 
vector to increase the transcription of a DNA encoding 
a C3B/C4B CR- like polypeptide of the present invention 
5 by higher eukaryotes. Enhancers are cis-acting 

elements of DNA, usually about 10-300 bp in length, 
that act on the promoter to increase transcription. 
Enhancers are relatively orientation and position 
independent. They have been found 5' and 3 1 to the 

10 transcription unit. Several enhancer sequences 

available from mammalian genes are known (e.g., globin, 
elastase, albumin, alpha-f eto-protein and insulin). 
Typically, however, an enhancer from a virus will be 
used. The SV40 enhancer, the cytomegalovirus early 

15 promoter enhancer, the polyoma enhancer, and adenovirus 
enhancers are exemplary enhancing elements for the 
activation of eukaryotic promoters. While an enhancer 
may be spliced into the vector at a position 5' or 3' 
to a C3B/C4B CR-like nucleic acid molecule, it is 

20 typically located at a site 5' from the promoter. 

Expression vectors of the invention may be 
constructed from a starting vector such as a 
commercially available vector. Such vectors may or may 
not contain all of the desired flanking sequences. 

2 5 Where one or more of the desired flanking sequences are 

not already present in the vector, they may be 
individually obtained and ligated into the vector. 
Methods used for obtaining each of the flanking 
sequences are well known to one skilled in the art. 

3 0- Preferred vectors for practicing this -invention 

are those which are compatible with bacterial, insect, 
and mammalian host cells. Such vectors include, inter 
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alia, pCRII, pCR3 , and pcDNA3 . 1 (Invitrogen Company, 
Carlsbad, CA) , pBSII (Stratagene Company, La Jolla, 
CA) , pET15D (Novagen, Madison, WI) , pGEX (Pharmacia 
Biotech, Piscataway, NJ) , pEGFP-N2 (Clontech, Palo 
5 Alto, CA) , pETL (BlueBacII; Invitrogen), pDSR-alpha 

(PCT Publication No. WO90/14363) and pFastBacDual 

(Gibco/BRL , Grand Island, NY) . 

Additional suitable vectors include, but are not 

limited to, cosmids, plasmids or modified viruses, but 

10 it will be appreciated that the vector system must be 

compatible with the selected host cell. Such vectors 

include, but are not limited to .plasmids such as 
® 

Bluescript plasmid derivatives (a high copy number 
ColEl-based phagemid, Stratagene Cloning Systems Inc., 
15 La Jolla CA) , PCR cloning plasmids designed for cloning 
Taq-amplif ied PCR products (e.g., TOPO™ TA Cloning* 

Kit, PCR2 . 1 plasmid derivatives, Invitrogen, Carlsbad, 
CA) , and mammalian, yeast, or virus vectors such as a 
baculovirus expression system (pBacPAK plasmid 
20 derivatives, Clontech, Palo Alto, CA) . 

After the vector has been constructed and a 
nucleic acid molecule encoding a C3b/C4b CR-like 
polypeptide has been inserted into the proper site of 
the vector, the completed vector may be inserted into a 

25 suitable host cell for amplification and/or polypeptide 
expression. The transformation of an expression vector 
for a C3b/C4b CR-like polypeptide into a selected host 
cell may be accomplished by well known methods 
including methods such as transf ection, infection, 

30 calcium chloride, electroporation, microinjection, 
lipofection or the DEAE-dextran method or other known 
techniques. The method selected will in part be a 
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'function of the type of host cell to be used. These 
methods and other suitable methods are well known to 
the skilled artisan, and are set forth, for example, in 
' Sambrook et al., supra. 

'5 Host cells may be prokaryotic host cells (such as 

<E. coli) or eukaryotic host cells (such as a yeast 
cell, an insect cell or a vertebrate cell). The host 
cell, when cultured under appropriate conditions, 
synthesizes a C3b/C4b CR-like polypeptide which can 

10 subsequently be collected from. the culture medium (if 
the host cell secretes it into the medium) or directly 
from, the host cell producing it (if it is not 
secreted) . The selection of an appropriate host cell 
will depend upon various, factors, such as desired 

15 expression levels, polypeptide modifications that are 
desirable . or necessary for activity, such as 
glycosylation or phosphorylation, and ease of folding 
into a biologically active molecule. 

A number of suitable host cells are known in the 
•20 art and many are available from the American Type 
Culture Collection (ATCC) , 10801 University Boulevard, 
Manassas, VA 20110-2209. Examples include, but are 
not limited to, mammalian cells, such as Chinese 
hamster ovary cells (CHO) (ATCC No. CCL61) CHO DHFR- 

25 cells (Urlaub et al . , Proc. Natl. Acad. Scl. USA, 
97 :4216-4220 " (1980) ) , human embryonic kidney (HEK) 293 
or 293T cells (ATCC No. CRL1573) , or 3T3 cells (ATCC 
No. CCL92) . The selection of suitable mammalian host 
cells and methods for transformation, culture, 

30 amplification, screening and product production and 
purification are known in the art. Other suitable 
mammalian cell lines, are the monkey COS-1 (ATCC No. 
CRL1650) and COS-7 cell lines (ATCC No. CRL1651.) , and 
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the CV-1 cell line (ATCC.No. CCL70) . Further exemplary 
mammalian host cells include primate cell lines and 
rodent cell lines, including transformed cell lines. 
Normal diploid cells, cell strains derived from in 
5 vitro culture of primary tissue, as well as primary 
explants, are also suitable. Candidate cells may be 
genotypically deficient in the selection gene, or may 
contain a dominantly acting selection gene. Other 
suitable mammalian cell lines include but are not 

10 limited to, mouse neuroblastoma N2A cells, HeLa, mouse 
L-929 cells, 3T3 . lines derived from Swiss, Balb-c or 
NIH mice, BHK or HaK hamster cell lines, which are 
available from the ATCC. Each of these cell lines is 
known by and available to those skilled in the art of 

15 protein expression. 

Similarly useful as host cells suitable for the 
present invention are bacterial cells. For example, 
the various strains of E m coli (e.g., HB101, (ATCC No. 
33694) DH5a, DH10, and MC1061 (ATCC No, 53338)) are 
20 well-known as host cells in the field of biotechnology. 
Various strains of B. subtilis, Pseudomonas spp. , other 
Bacillus spp., Strep tomyces spp., and the like may also 
be employed in this method. 

Many strains of yeast cells known to those skilled 
25 in the art are also available as host cells for the 
expression of the polypeptides of the present 
invention. Preferred yeast cells include, for example, 
Saccharomyces cerivisae and Pichia pastoris. 

Additionally, where desired, insect cell systems 
30 may be utilized in the methods of the present 
invention. Such systems are described for example in 
Kitts et al. t Biotechniques, 14:810-817 (1993); 
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'Lucklow, Curr. Opin. Biotechnol., £:564-572 (1993); and 
Lucklow et al. (J*. Virol., 67:4566-4579 (1993). 
Preferred insect cells are Sf-9 and Hi5 (Invitrogen, 

'Carlsbad, CA) . 

5 One may also use transgenic animals to express 

glycosylated C3b/C4b CR-like polypeptides.. For 
^example, one may use a transgenic mi lk -producing animal 
(a cow or goat, for example) and obtain the present 
glycosylated polypeptide in the animal milk. One may 

10 also use plants to produce C3b/C4b CR-like 
polypeptides, however, in general, the glycosylation 
occurring in plants is different from that produced in 
mammalian cells, and may result in a glycosylated 
product which is not suitable for human therapeutic 

15 use. 

Polypeptide Production 

. Host cells comprising a C3b/C4b CR-like 
polypeptide expression vector may be cultured using 

20 standard media well known to the skilled artisan. The 
media will usually contain all nutrients necessary for 
the growth and survival of the cells. Suitable media 
for culturing E. coli cells include, for example, Luria 
Broth (LB) and/or Terrific Broth (TB) . Suitable media 

25 for culturing eukaryotic cells include Roswell Park 
Memorial Institute medium 1640 (RPMI 1640) , Minimal 
Essential Medium (MEM) and/or Dulbecco f s Modified Eagle 
Medium (DMEM) , all of which may be supplemented with 
serum and/or growth factors as indicated by the 

30 particular cell line being cultured. A suitable medium 
for insect cultures is Grace's medium supplemented with 



61 



WO 02/10199 



PCT/US01/23232 



'yeastolate, lactalbumin hydrolysate and/or fetal calf 
serum, as necessary. 

Typically, an antibiotic or other compound useful 
Z] for selective growth of transformed cells is added as a 
5 supplement to the media. The compound to be used will 
be dictated by the selectable marker element present on 
, the plasmid with which the host cell was transformed. 
For example, whex^e the selectable marker element is 
kanamycin resistance, the compound added to the culture 
10 medium will be kanamycin. Other compounds for 

selective growth include ampici.llin, tetracycline, and 
neomycin. 

The amount of a C3b/C4b CR-like polypeptide 
produced by a host cell can be evaluated using standard 

15 methods known in the art. Such methods include, 
without limitation, Western blot analysis, SDS- 
poly aery 1 amide gel electrophoresis, non- denaturing gel 
electrophoresis, HPLC separation, immunoprecipi tat ion, 
and/or activity assays such as DNA binding gel shift 

2 0 assays. 

If a C3b/C4b CR-like polypeptide has been designed 
to be secreted from the host cells, the majority , of 
polypeptide may be found in the cell culture medium. 
If however, the C3b/C4b CR-like polypeptide is not 
25 secreted from the host cells, it will be present in the 
cytoplasm and/or the nucleus (for eukaryotic host 
cells) or in the cytosol (for bacterial host cells) . 

For a C3b/C4b CR-like polypeptide situated in the 
host cell cytoplasm and/or the nucleus (for eukaryotic 
30 host cells) or in the cytosol (for bacterial host 
cells) , intracellular material (including inclusion 
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bodies for gram- negative bacteria) can be extracted 
from the host cell using any standard technique known 
to the skilled artisan. For example, the host cells 
can be lysed to release the contents of the 
5 periplasm/ cytoplasm by French press, homogenization, 
and/or sonication followed by centrif ugation . 

If a C3b/C4b CR-like polypeptide has formed 
inclusion bodies in the cytosol, the inclusion bodies 
can often bind to the inner and/or outer cellular 

10 membranes and thus will be found primarily in the 
pellet material after cent rifugat ion . The pellet 
material can then be treated at pH extremes or with a 
chaotropic agent such as a detergent, guanidine, 
guanidme derivatives, urea, or urea derivatives in the 

15 presence of a reducing agent such as dithiothreitol * at 
alkaline pH or tris carboxyethyl phosphine at acid pH 
to release, break apart, and solubilize the inclusion 
bodies. The C3b/C4b CR-like polypeptide in its now 
soluble form can then be analyzed using gel 

20 electrophoresis, immunoprecipitation or the like. • If 
it is desired to isolate the C3b/C4b CR-like 
polypept ide, isolation may be accomplished using 
standard methods such as those described herein and in 
Mars'ton et al., Meth. Enz., 182 :264-275 (1990). 

25 In some cases, a C3b/C4b CR-like polypeptide may 

not be biologically active upon isolation. Various 
methods for "refolding" or converting the polypeptide 
to its tertiary structure and generating disulfide 
linkages can be used to restore biological activity. 

3 0 Such methods include exposing the solubilized 
polypeptide to a pH usually above 7 and in the presence 
of a particular concentration of a chaotrope. The 
selection of chaotrope is very similar to the choices 
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used for inclusion body solubilization, but usually the 
chaotrope is used at a lower concentration and is not 
necessarily the same as chaotropes used for the 
solubilization. In most cases the refolding/oxidation 
5 solution will also contain a reducing agent or the 
reducing agent plus its oxidized form in a specific 
ratio to generate a particular redox potential allowing 
for disulfide shuffling , to occur in the formation of 
the protein's cysteine bridge (s) . Some of the commonly 

10 used redox couples include cysteine/cystamine, 
glutathione (GSH) /dithiobis GSH, cupric chloride, 
dithiothreitol (DTT) / dithiane DTT, and 2- 

2mercaptoethanol (bME) /dithio-b (ME) . A cosolvent may be 
used to increase the efficiency of the refolding, and 

15 the more common reagents used for this purpose include 
glycerol, polyethylene glycol of various molecular 
weights, arginine and the like. 

• If inclusion bodies are not formed to a 
significant degree upon expression of a C3b/C4b CR-like 
20 polypeptide, then the polypeptide will be found 
primarily in the supernatant after centrif ugation of 
the cell homogenate. The polypeptide may be further 
isolated from the supernatant using methods such as 
those described herein. 

25 The purification of a C3b/C4b CR-like polypeptide 

from solution can be accomplished using a variety of 
techniques. If the polypeptide has been synthesized 
such that it contains a tag such as Hexahistidine 
(C3b/C4b CR-like polypeptide/hexaHis) or other small 

30 peptide such as FLAG (Eastman Kodak Co., New Haven, CT) 
or myc (Invitrogen, Carlsbad, CA) at either its 
carboxyl or amino terminus, it may be purified in a 
one- step process by passing the solution through an 
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'affinity column where the column matrix has a high 
affinity for the tag. 

For example, polyhistidine binds with great 
affinity and specificity to nickel, thus an affinity 
5 column of nickel (such as the Qiagen* nickel columns) 
can be used for purification of C3b/C4b CR-like 
polypeptide/polyHis . See for example, Ausubel et al . , 
eds., Current Protocols in Molecular Biology, Section 
10.11.8, John Wiley & Sons, New York (1993). 

10 . Additionally, the C3B/C4B CR-like polypeptide may 

be purified through the use of a monoclonal antibody 
which is capable of specifically recognizing and 
binding to the C3B/C4B CR-like polypeptide. 

. . ' Suitable procedures for purification thus include, 
15 without limitation, affinity chromatography, 

immunoaf f inity chromatography, ? on exchange 

chromatography, molecular sieve chromatography, High 
Performance Liquid Chromatography (HPLC) , 

electrophoresis (including native gel electrophoresis) 
20 followed by gel elution, and preparative isoelectric 
focusing ("Isoprime" machine/technique, Hoefer 
Scientific, San Francisco, CA) . In some cases, two or 
more purification techniques may be combined to achieve 
increased purity. 

25 C3b/C4b CR-like polypeptides may also be prepared 

by chemical synthesis methods (such as solid phase 
peptide synthesis) using techniques known in the art, 
such as those set forth by Merrifield et al . , J". Am. 
Chem. Soc, £5:2149 (1963), Houghten et al . , Proc Natl 

30 Acad. Sci. USA, 82:5132 (1985), and Stewart and Young, 
Solid Phase Peptide Synthesis, Pierce Chemical Co., 
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{ Rockford, IL (1984) . 



Such polypeptides may be 



synthesized with or without a methionine on the amino 



'polypeptides may be oxidized using methods set forth in 
-5 these references to form disulfide bridges. Chemically 
synthesized C3b/C4b CR-like polypeptides are expected 
to have comparable biological activity to the 
corresponding C3b/C4b CR-like polypeptides produced 
recombinant ly or purified from natural sources, and 
10 thus may be used interchangeably with a recombinant or 
natural C3b/C4b CR-like polypeptide. 

Another means of obtaining a C3b/C4b CR-like 
polypeptide is via purification from biological samples 
such as source tissues and/or fluids in which the 

15 C3b/C4b CR-like polypeptide is naturally found. Such 
purification can be conducted using methods fox* protein 
purification as described herein. The presence of the 
C3b/C4b CR-like polypeptide during purification may be 
monitored using, for example/ an antibody prepared 

2 0 against recombinant ly produced C3b/C4b CR-like 
polypeptide or peptide fragments thereof. 

A number of additional methods for producing 
nucleic acids and polypeptides are known in the art, 
and can be used to produce polypeptides having 

25 specificity for C3b/C4b CR-like. See for example, 
Roberts et al., Proc. Natl. Acad. Sci . t £4:12297-12303 
(1997) , which describes the production of fusion 
proteins between an mRNA arid its encoded peptide. See 
also Roberts, R. , Curr. Opin. Chem. Biol., 2 :268 ~ 273 

30 (1999). Additionally, U.S. patent No. 5,824,469 

describes methods of obtaining oligonucleotides capable 
of carrying out a specific biological function. The 



terminus . 



Chemically synthesized C3b/C4b CR-like 
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procedure involves generating a heterogeneous pool of 
oligonucleotides, each having a 5' randomized sequence, 
a central preselected sequence, and a 3' randomized 
sequence. The resulting heterogeneous pool is 

5 introduced into a population of cells that do not 
exhibit the desired biological function. 

Subpopulations of the cells are then screened ' for those 
which exhibit a predetermined biological function. 
From that subpopulation , oligonucleotides capable of 
10 carrying out the desired biological . function are 
isolated. 

U.S. Patent Nos. 5,763,192, 5,814,476, 5,723,323, 
and 5,817,483 describe processes for producing peptides 
or polypeptides. This is done by producing stochastic 

15 genes or fragments thereof, and then introducing these 
genes into host cells which produce one or more 
proteins encoded by the stochastic genes. The host 
cells are then screened to identify those clones 
producing peptides or polypeptides having the desired 

20 activity. 

Chemical Derivatives 

Chemically modified derivatives of the C3b/C4b CR- 
like polypeptides may be prepared by one skilled in the 
art, given the disclosures set forth hereinbelow. 

25 C3b/C4b CR-like polypeptide derivatives are modified in 
a manner, that is different, either in the type or 
location of the molecules naturally attached to the 
polypeptide. Derivatives may include molecules formed 
by the deletion of one or more naturally-attached 

30 chemical groups. The polypeptide comprising the amino 
acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID 
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J NO:7, or a C3b/C4b CR-like polypeptide variant may be 
modified by the covalent attachment of one or more 
polymers- For example, the polymer selected is 

' typically water soluble so that the protein to which it 

'* 5 is attached does not precipitate in an aqueous 
environment, such as a physiological environment. 
Included within the scope of suitable polymers is a 
mixture of polymers. Preferably, for therapeutic use 
of the end-product preparation, the polymer will be 

10 pharmaceutically acceptable. 

The polymers each may be of any molecular weight 
* and may be branched or unbranched. The polymers each 
typically have an average molecular weight of between 
about 2kDa to about lOOkDa (the term "about" indicating 

15 that in preparations of a water soluble polymer, some 
molecules will weigh more, some less, than the stated 
molecular weight) The average molecular weight of 

each polymer preferably is between about 5kDa and about 
SOkDa; more preferably between about 12kDa and about 

2 0 4 0kDa and most preferably between about 2 0kDa and about 
35kDa. 

Suitable water soluble polymers or mixtures 
thereof include, but are not limited to, N-linked or O- 
1 inked carbohydrates, sugars, phosphates, polyethylene 

25 glycol (PEG)- (including the forms of PEG that have been 
used to derivatize proteins, including mono-(Ci-Cio) 
alkoxy- or aryloxy-polyethylene glycol) , monomethoxy- 
polyethylene glycol, dextran (such as low molecular 
weight dextran, of, for example about 6 kD) , cellulose, 

30 or other carbohydrate based polymers, poly- (N-vinyl 
pyrrolidone) polyethylene glycol, propylene glycol 
homopolymers , a polypropylene oxide/ethyl ene oxide co- 
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polymer, polyoxyethylated polyols (e.g., glycerol) and 
polyvinyl alcohol- Also encompassed by the present 
invention are bifunctional crosslinking molecules which 
may be used to prepare covalently attached multimers of 
5 the polypeptide . comprising the amino acid sequence of 
SEQ ID NO: 2, SEQ ID NO : 4 , or SEQ ID NO : 7 , or a C3b/C4b 
CR-like polypeptide variant. 

In general, chemical dexivatization may be 
performed under any suitable condition used to react . a 

10 protein with an activated polymer molecule. Methods 
for preparing chemical derivatives of polypeptides will 
generally comprise the steps of (a) reacting the 
polypeptide with the activated polymer molecule (such 
as a reactive ester or aldehyde derivative of the 

15 polymer molecule) under conditions whereby the 
polypeptide comprising the amino acid sequence of SEQ 
ID NO: 2, SEQ ID NO: 4, or SEQ ID NO: 7, or a C3b/C4b CR- 
like- polypeptide variant becomes attached to one or 
more polymer molecules, and (b) obtaining the reaction 

20 product (s). The optimal reaction conditions will be 
determined based on known parameters and the desired 
result. For example, the larger the ratio of polymer 
molecules :protein, the greater the percentage of 
attached polymer molecule. In one embodiment, the 

25 C3b/C4b CR-like polypeptide derivative may have a 
single polymer molecule moiety at the amino terminus. 
See, for example, U.S. Patent No. 5,234,784. 

The pegylation of the polypeptide specifically may 
be carried out by any of the pegylation reactions known 
30 in the art, as described for example in the following 
references: Francis et al . , Focus on Growth Factors, 
3:4-10 (1992); EP 0154316; EP 0401384 and U.S. Patent 
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No. 4,179,337. For example, pegylation may be carried 
out via an acylation reaction or an alkylation reaction 
with a reactive polyethylene glycol molecule (or an 
analogous reactive water-soluble polymer) as described 
5 herein. For the acylation reactions, the polymer (s) 
selected should have a single reactive ester group. 
For reductive alkylation, the polymer (s) selected 
should have a single reactive aldehyde group. A 
reactive aldehyde is, for example, polyethylene glycol 
10 propionaldehyde, which is water stable, or mono Ci-Ci 0 
alkoxy or aryloxy derivatives thereof (see U.S. Patent 
No. 5,252,714) . 

In another embodiment, C3b/C4b CR-like 
polypeptides may be chemically coupled to biotin, and 

15 the biotin/C3b/C4b CR-like polypeptide molecules which 
are conjugated are then allowed to bind to avidin, 
resulting in tetravalent avidin/biotin/C3b/C4b CR-like 
polypeptide molecules. C3b/C4b CR-like polypeptides 
may also be covalently coupled to dinitrophenol (DNP) 

20 or trinitrophenol (TNP) and the resulting conjugates 
precipitated with ant i -DNP or anti-TNP-IgM to form 
decameric conjugates with a valency of 10. 

Generally, conditions which may be alleviated or 
modulated by the administration of the present C3b/C4b 

25 CR-like polypeptide derivatives include those described 
herein for C3b/C4b CR-like polypeptides. However, the 
C3b/C4b CR-like polypeptide derivatives disclosed 
herein may have additional activities, enhanced or 
reduced biological activity, or other characteristics, 

30 such as increased or decreased half- life, as compared 
to the non-derivatized molecules. 
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^ Genetically Engineered Non-Human Animals 

Additionally included within the scope of the 
present invention are non-human animals such as mice, 
'rats, or other rodents, rabbits, goats, or sheep, or 
5 other farm animals, in which the gene (or genes) 
encoding the native C3b/C4b CR-like polypeptide has 
(have) been disrupted ("knocked but") such that the 
"level of expression of this gene or genes is (are) 
significantly decreased or completely abolished. Such 
10 animals may be prepared* using techniques and methods 
such as those described in U.S. Patent No. 5,557,032. 

The present invention further includes non-human 
animals such as mice, rats, or other rodents, rabbits, 
goats, sheep, or other farm animals, in which either 

15 the native form of the C3b/C4b CR-like gene(s) for that 
animal or a heterologous C3b/C4b CR-like gene(s) is 
(are) over- expressed by the animal, thereby creating a 
"transgenic" animal. Such transgenic animals may be 
prepared using well known methods such as those 

20 described in U.S. Patent No 5,4 89,743 and PCT 
application No. W094/28122. 

The present invention further includes non-human 
animals in which the promoter for one or more of the 
C3b/C4b CR-like polypeptides of the present invention 
25 is either activated or inactivated (e.g. , by using 
homologous recombination methods) to alter the level of 
expression of one or more of the native C3b/C4b CR-like 
polypeptides . 

These non-human animals may be used for drug 
3 0 candidate screening. In such screening, the impact of 
a drug candidate on the animal may be measured. For 
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'' example , drug candidates may decrease or increase the 
expression of the C3b/C4b CR-like gene. In certain 
embodiments, the amount of C3b/C4b CR-like polypeptide, 

'that is produced may be measured after the exposure of 
— 5 the animal to the drug candidate. Additionally, in 
certain embodiments, one may detect the actual impact 
of the drug candidate on the animal. For example, the 

' ove r express ion of a particular gene may result in, or 
be associated with, a disease or pathological 
10 condition. In such cases, one may test a drug 

candidate's ability to decrease expression of the gene 
or its ability to prevent or inhibit a pathological 
condition. In other examples, the production of a ■ 
particular metabolic product such as a fragment of a 
15 polypeptide,, may result in, or be associated with, a 
disease or pathological condition. In such cases, one 
may test a drug candidate's ability to decrease the 
production of such a metabolic product or its ability 
to prevent or inhibit a pathological condition. 

20 

Microarray 

It will be appreciated that DNA microarray 
technology can be utilized in accordance with the 
present invention. DNA microarrays are miniature, high 

2 5 density arrays of nucleic acids positioned on a solid 
support, such as glass. Each cell or element within 
the array has numerous copies of a single species of 
DNA which acts as a target for hybridization for its 
cognate mRNA. In expression profiling using DNA 

30 microarray technology, mRNA is first extracted from a 
cell or tissue sample and then converted enzymatically 
to f luorescently labeled cDNA . This material is 
hybridized to the microarray and unbound cDNA is 
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removed by washing. The expression of discrete genes 
represented on the array is then visualized by 
quant i t at ing the amount of labeled cDNA which is 
specifically bound to each target DNA. In this way, 
5 the expression of thousands of genes can be quantitated 
in a high throughput, parallel manner from a single 
sample of biological material . 

This high throughput expression profiling has a 
broad range of applications with respect to the C3b/C4b 

10 CR-like molecules of the invention, including, but not 
limited to: the identification and validation of 
C3b/C4b CR-like disease-relatedi genes as targets for 
therapeutics; molecular toxicology of C3b/C4b CR-like 
molecules and inhibitors thereof; stratification of 

15 populations and generation of surrogate markers for 
clinical trials; and enhancing C3b/C4b CR-like-related 
small molecule drug discovery by aiding in the 
identification of selective compounds in high 
throughput screens (HTS) . 

20 

Selective Binding Agents 

As used herein, the term "selective binding agent" 
refers to a molecule which has specificity for one or 
more C3b/C4b CR-like polypeptides. Suitable selective 

25 binding agents include, but are not limited to, 
antibodies and derivatives thereof, polypeptides, and 
small molecules. Suitable selective binding agents may 
be prepared using methods known in the art. An 
exemplary C3B/C4B CR-like polypeptide selective binding 

30 agent of the present invention is capable of binding a 
certain portion of the C3B/C4B CR-like polypeptide 
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thereby inhibiting the binding of the polypeptide to 
the C3B/C4B CR-like polypeptide receptor (s) . 

Selective binding agents such as antibodies and 
antibody fragments that bind C3b/C4b CR-like 
5 polypeptides are within the scope of the present 
invention. The antibodies may be polyclonal including 
monospecific polyclonal, monoclonal (MAbs) , 

recombinant, chimeric, humanized such as GDR-gcaf ted, 
human, single chain, and/or bispecific, as well as 

10 fragments, variants or derivatives thereof. Antibody 
fragments include those portions of the antibody which 
bind to an epitope on the C3B/C4B CR-like polypeptide. 
Examples of such fragments include Fab and F(ab') 
fragments generated by enzymatic cleavage of full- 

15 length antibodies. Other binding fragments include 
those generated by recombinant DNA techniques, such as 
the expression of recombinant plasmids containing 
nucleic acid sequences encoding antibody variable 
regions. 

2 0 Polyclonal antibodies, directed toward a C3b/C4b 

CR-like polypeptide generally are produced in animals 
(e.gr., rabb.its or mice) by means of multiple 
subcutaneous or intraperitoneal injections of C3b/C4b 
CR-like polypeptide and an adjuvant. It may be useful 
25 to conjugate a C3b/C4b CR-like polypeptide to a carrier 
protein that is immunogenic in the species to be 
immunized, such as keyhole limpet heocyanin, serum, 
albumin, bovine thyroglobulin, or soybean trypsin 
inhibitor. Also, aggregating agents such as alum are 

3 0 used to enhance the immune response. After 

immunization, the animals are bled and the serum is 
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: ~ ' assayed for anti-C3b/C4b CR-like polypeptide antibody 
titer. 

Monoclonal antibodies directed toward a C3b/C4b 
'CR-like polypeptide are produced using any method which 
* 5 provides for the production of antibody molecules by 
continuous cell lines in culture. Examples of suitable 
methods for preparing monoclonal antibodies include the 
hybridoma methods of Kohler et al . , Nature,. 2^56:495-497 
(1975) and the human B-cell hybridoma method / Kozbor, 
10 J\ Immunol., 133 : 3001 (1984); Brodeur et al . , 
Monoclonal Antibody Production Techniques and 
Applications, pp. 51-63 (Marcel Dekker, Inc., New York, 
1987) . Also provided by the invention are hybridoma 
cell lines which produce monoclonal antibodies reactive 
15 with C3b/C4b CR-like polypeptides. 

Monoclonal antibodies of the invention may ' be 
modified for use as therapeutics. One embodiment is a 
"chimeric" antibody in which a portion of the heavy 
and/or light chain is identical with or homologous to a 

20 corresponding sequence in antibodies derived from a 
particular species or belonging to a particular 
antibody class or subclass, while the remainder of the 
chain (s) is identical with or homologous to a 
corresponding sequence in antibodies derived from 

25 another species or belonging to another antibody class 
or subclass. Also included are fragments of such 
antibodies, so long as they exhibit the desired 
biological activity. See, U.S. Patent No. 4,816,567; 
Morrison et al . , Proc. Natl. Acad. Sci . , 81 : 6851-6855 

30 (1985) . 

In another embodiment, a monoclonal antibody of 
the invention is a "humanized" antibody. Methods for 
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humanizing non-human antibodies are well known in the 
art. See U.S. Patent Nos. 5,585,089, and 5,693,762. 
Generally, a humanized antibody has one or more amino 
acid residues introduced into it from a source which is 
5 non-human. Humanization can be performed, for example, 
using methods described in the art (Jones et al . , 
Nature 321 : 522-525 (1986); Riechmann et al . , Nature, 
332:323-327 (1988); Verhoeyen et al . , Science 239:1534- 
1536 (1988)), by substituting at least a portion of a 
10 rodent complementarity-determining region (CDR) for the 
corresponding regions of a human antibody. 

Also encompassed by the invention are human 
antibodies which bind C3b/C4b CR-like polypeptides. 
Using transgenic animals (e.gr., mice) that are capable 

15 of producing a repertoire of human ancibodies in the 
absence of endogenous immunoglobulin production such 
antibodies are produced by immunization with a C3b/C4b 
CR-like antigen {i.e., having at least 6 contiguous 
amino acids) , optionally conjugated to a carrier. See, 

20 for example, Jakobovits et al . , Proc. Natl. Acad. Sci . , 
90:2551-2555 (1993); Jakobovits et al . , Nature 362 :255- 
258 (1993); Bruggemann et al . , Year in Immune, 7:33 
(1993) . In one method, such transgenic animals are 
produced by incapacitating the endogenous loci encoding 

25 the heavy and light immunoglobulin chains therein, and 
inserting loci encoding human heavy and light chain 
proteins into the genome thereof. Partially modified 
animals, that is those having less than the full 
complement of modifications, are then cross-bred to 

3 0 obtain an animal having all of the desired immune 
system modifications. When administered an immunogen, 
these transgenic animals produce antibodies with human 
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~ ' (rather than e.g., murine) amino acid sequences, 
including variable regions which are immunospecif ic for 
these antigens. See PCT application nos. 

'PCT/US96/05928 and PCT/US93/06926 . Additional methods 
-5 are described in U.S. Patent No. 5,545,807, PCT 
application nos. PCT/US91/245 /• PCT/GB89/01207 , and in 
EP 546073B1 and EP 546073A1. Human antibodies may also 
^be produced by the expression of recombinant DNA in 
host cells or by expression in hybridoma cells as 
10 described herein. 

In an alternative embodiment, human antibodies can 
be produced from phage-display libraries (Hoogenboom et 
al., J". Mol. Biol. 227 :381 (1991); Marks et aJ * , i7. 
Mol.. Biol: 222:581 (1991). . These, processes mimic 

15 immune selection through the display of ' antibody 
repertoires on the surface of filamentous 
bacteriophage, and subsequent selection of phage by 
their binding to an antigen of choice. One such 
technique is described in PCT Application no. 

2.0 PCT/US98/17364 , which describes the isolation of high 
affinity and functional agonistic antibodies for MPL>- 
and msk- receptors using such an approach. 

Chimeric, CDR grafted, and humanized antibodies 
are typically produced by recombinant methods. Nucleic 

25 acids encoding the antibodies are introduced into host 
cells and expressed using materials and procedures 
described herein. In a preferred embodiment, the 
antibodies are produced in mammalian host cells, such 
as CHO cells. Monoclonal (e.g., human) antibodies may 

3 0 be produced by the expression of recombinant DNA in 
host cells or by expression in hybridoma cells as 
described herein. 
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' The anti-C3b/C4b CR-like antibodies cf the 

invention may be employed in any known assay method, 
such as competitive binding assays, direct and indirect 

'sandwich assays, and immunoprecipitation assays (Zola, 
-5 Monoclonal Antibodies: A Manual of Techniques, pp. 
147-158 (CRC Press, Inc., 1987)) for the detection and 
quantitation of C3b/C4b CR-like polypeptides. The 

•antibodies will bind C3b/C4b CR-like polypeptides with 
an affinity which is appropriate for the assay method 
10 being employed. 

For diagnostic applications, in certain 
embodiments, anti-C3b/C4b CR-like antibodies may be 
labeled with a detectable moiety. The detectable 
moiety can be any one which is capable of producing, 

15 either directly or indirectly, a detectable signal. 
For example, the detectable moiety may be a 
radioisotope, such as . 3 H, 14 C, 32 P, 35 S, or 125 I, a 
. f lucrescent or chemiluminescent compound, such as 
fluorescein isothiocyanate , rhodamine, or luciferin;. or 

20 an ..enzyme, such as. alkaline phosphatase, , 

galactosidase, or horseradish peroxidase (Bayer et al . , 
Meth. Enz. t 184 :138-163 (1990)). 

Competitive binding assays rely on the ability of 
a labeled standard (e.g., a C3b/C4b CR-like 

25 polypeptide,' or an immunologically reactive portion 
thereof) to compete with the test sample analyte (an 
C3b/C4b CR-like polypeptide) for binding with a limited 
amount of anti C3b/C4b CR-like antibody. The amount of 
a C3b/C4b CR-like polypeptide in the test sample is 

30 inversely proportional to the amount of standard that 
becomes bound to the antibodies. To facilitate 

determining the amount of standard that becomes bound, 
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the antibodies typically are insolubilized before or 
after the competition, so that the standard and analyte 
that are bound to the antibodies may conveniently be 
separated from the standard and analyte which remain 
5 unbound. 

Sandwich assays typically involve the use of two 
antibodies, each capable of binding to a different 
immunogenic portion, or epitope, of the protein u.o be 
detected and/or quant itated. In a sandwich assay, the 

10 test sample analyte is typically bound by a first 
antibody which is immobilized on a solid support, and 
thereafter a second antibody binds to the analyte, thus 
forming an insoluble three part complex. See, e.g. , 
U.S. Patent No. 4,3 76,110. The second antibody may 

15 itself be labeled with a detectable moiety (direct 
sandwich assays) or may be measured using an anti- 
immunoglobulin antibody that is labeled with a 
detectable moiety (indirect sandwich .assays) . For 
example, one type of sandwich .assay is an enzyme- linked 

20 immunosorbent assay (ELISA) , in . which case the 
detectable moiety is an enzyme. 

The selective binding agents, including anti- 
C3b/C4b CR-like antibodies, also are useful for in vivo 
imaging. An antibody labeled with a detectable moiety 

25 may be administered to an animal, preferably into the 
bloodstream, and the presence and location of the 
labeled antibody in the host is assayed. The antibody 
may be labeled with any moiety that is detectable . in an 
animal, whether by nuclear magnetic resonance, 

30 radiology, or other detection means known in the art. 
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Selective binding agents of the invention, 
including antibodies, may be used as therapeutics. 
These therapeutic agents are generally agonists or 
antagonists, in that they either enhance or reduce, 
5 respectively, at least one of the biological activities 
of a C3b/C4b CR-like polypeptide. In one embodiment, 
antagonist antibodies of the invention are antibodies 
or binding fragments thereof which are capable of 
specifically binding to a C3b/C4b CR-like polypeptide 

10 and which are capable of inhibiting or eliminating the 
functional activity of a C3b/C4b CR-like polypeptide in 
vivo or in vitro. In preferred embodiments, the 
selective binding agent, e.g. , an antagonist antibody, 
will inhibit the functional activity of a C3b/C4b CR- 

15 like polypeptide by at least about 50%, and preferably 
by at least about 80%. In another embodiment, the 
selective binding agent may be an antibody that is 
capable of interacting with a C3b/G4b CR-like binding 
partner (a ligand or receptor) thereby inhibiting or 

20 eliminating C3b/C4b CR-like activity in vitro or in 
vivo. Selective binding agents, including agonist and 
antagonist anti-C3b/C4b CR-like antibodies, are 
identified by screening assays which axe well known in 
the art . 

25 The invention also relates to a kit comprising 

C3b/C4b CR-like selective binding agents (such as 
antibodies) and other reagents useful for detecting 
C3b/C4b CR-like polypeptide levels in biological 
samples. Such reagents may include, a detectable 

30 label, blocking serum, positive and negative control 
samples, and detection reagents. 

C3b/C4b CR-like polypeptides can be used to clone 
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J C3b/C4b CR-like ligand(s) using an "expression cloning" 
strategy. Radiolabeled (125-Iodine) C3b/C4b CR-like 
polypeptide or "affinity/activity-tagged" C3b/C4b CR- 
. like polypeptide (such as an Fc fusion or an alkaline 
. 5 phosphatase fusion) can be used in binding assays to 
identify a cell type or cell line or tissue that 
expresses C3b/C4b CR-like ligand(s) . RNA isolated from 
<■ such cells or tissues can then be converted to cDNA, 
cloned into a mammalian . expression vector, and 
10 transfected into mammalian cells (for example, COS,, or 
293) to create an expression library. Radiolabeled- or 
tagged C3b/C4b CR-like polypeptide can then be used as 
an affinity reagent to identify and isolate the subset 
of cells in this library expressing C3b/C4b CR-like 
15 ligand (s) . DNA is then isolated from these cells and 
transfected into mammalian cells to create a secondary 
expression library in which the fraction of cells 
expressing C3b/C4b CR-like ligand(s) woyld be many-fold 
higher than in the original library. This enrichment 

2 0 process can be repeated iteratively . until a single 

recombinant clone containing a C3b/C4b CR-like ligand 
is isolated. Isolation of C3b/C4b CR-like ligand (s) is 
useful for identifying or developing novel agonists and 
antagonists of the C3b/C4b CR-like signaling pathway. 
25 Such agonists and antagonists include C3b/C4b CR-like 
ligand(s), .anti-C3b/C4b* CR-like ligand antibodies, 
small molecules or antisense oligonucleotides. 

Assaying for other modulators of C3b/C4b CR-like 

3 0 Polypeptide activity 

In some situations, it may be desirable to 
identify molecules that are modulators, i.e., agonists 
or antagonists, of the activity of C3b/C4b CR-like 
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'polypeptide. Natural . or synthetic molecules that 
modulate C3b/C4b CR-like polypeptide may be identified 
using one or more screening assays, such as those 
'described herein. Such molecules may be administered 
■5 either in an ex vivo manner, or in an in vivo manner by 
injection, or by oral delivery, implantation device, or 
the like. 

"Test molecule (s) " refers to the molecule (s) that 
is/are under evaluation for the ability to modulate 

10 (i.e., increase or decrease) the activity of a C3b/C4b 
CR-like polypeptide. Most commonly, a test molecule 
will interact directly with a C3b/C4b CR-like 
polypeptide. However, it is also contemplated that a 
test molecule may , also modulate C3b/C4b CR-like 

15 polypeptide activity indirectly, such as by affecting 
C3b/C4b CR-like gene expression, or by binding to a 
C3b/C4b CR-like binding partner (e.g., receptor or 
ligand) . In one embodiment, a test molecule will bind 
to a C3b/C4b CR-like polypeptide with an affinity 

20 constant of at least about 10~ 6 M, preferably about 10*" 8 
M, more preferably about 10" 9 M, and even more 
preferably about 10* 10 M. 

Methods for identifying compounds which interact 
with C3b/C4b CR-like polypeptides are encompassed by 

2 5 the present* invention. In certain embodiments, a 
C3b/C4b CR-like polypeptide is incubated with a test 
molecule under conditions which permit the interaction 
of the test molecule with a C3b/C4b CR-like 
polypeptide, and the extent of the interaction can be 

30 measured. The test molecule (s) can be screened in a 
substantially purified form or in a crude mixture. 
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In certain embodiments, a C3b/C4b CR-like 
polypeptide agonist or antagonist may be a protein, 
peptide, carbohydrate, lipid, or small molecular weight 
molecule which interacts with C3b/C4b CR-like 
5 polypeptide to regulate its activity. Molecules which 
regulate C3b/C4b CR-like polypeptide expression include 
nucleic acids which are complementary to nucleic acids 
encoding a C3b/C4b CR-like polypeptide, or are 
complementary to nucleic acids sequences which direct 
10 or control the expression of C3b/C4b CR-like 
polypeptide, and which act as anti- sense regulators of 
expression. 

Once a set. of test, molecules has been identified 
as interacting with a C3b/C4b CR-like polypeptide, the 

15 molecules may be furthei* * evaluated for their ability to 
increase or decrease C3b/C4b CR-like polypeptide 
activity. The measurement of the interaction of test 
molecules with C3b/C4b CR-like polypeptides may be 
carried out in several formats, including cell -based 

2 0 binding assays, membrane binding assays, solution-phase 
assays and immunoassays. In general, test molecules 
are incubated with a C3b/C4b CR-like polypeptide for a 
specified period of time, and C3b/C4b CR-like 
polypeptide activity is determined by one or more 

25 assays for measuring biological activity. 

The interaction of test molecules with C3b/C4b CR- 
like polypeptides may also be assayed directly using 
polyclonal or monoclonal antibodies iri an immunoassay. 
Alternatively, modified forms of C3b/C4b CR-like 
30 polypeptides containing epitope tags as described 
herein may be used in immunoassays. 
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f In the event that C3b/C4b CR-like polypeptides 

display biological activity through an interaction with 
a binding partner (e.g., a receptor . or a ligand) , a 
•variety of in vitro assays may be used to measure the 
-5 binding of a C3b/C4b CR-like polypeptide to the 
corresponding binding partner (such as a selective 
binding agent, receptor, or ligand) . These assays may 
be used to screen test molecules for their ability to 
increase or decrease the rate and/or the extent of 
10 binding of a C3b/C4b CR-like polypeptide to its binding 
partner. In one assay, a C3b/C4b CR-like polypeptide 
is immobilized in the wells of a microtiter plate. 
Radiolabeled C3b/C4b CR-like binding partner (for 
example, iodinated C3b/C4b CR-like binding partner) and 
15 the test molecule (s) can then be added either one at a 
time (in either order) or simultaneously to the wells. 
After incubation, the wells can be washed and counted, 
using a scintillation counter, for radioactivity to 
determine the extent to which the binding partner bound 
20 to C3b/C4b CR-like polypeptide. Typically, the 

molecules will be tested over a range of 
concentrations, and a series of control wells lacking 
one or more elements of the test assays can be used for 
accuracy in the evaluation of the results. An 
25 alternative to this method involves reversing the 
"positions" of the proteins, i.e., immobilizing C3b/C4b 
CR-like binding partner to the microtiter plate wells, 
incubating with the test molecule and radiolabeled 
C3b/C4b CR-like polypeptide, and determining the extent 
30 of C3b/C4b CR-like polypeptide binding. See, for 
example, chapter 18, Current Protocols in Molecular 
Biology, Ausubel et al . , eds . , John Wiley & Sons, New 
York, NY (1995) . 
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As an alternative to radiolabeliing, a C3b/C4b CR- 
like polypeptide or its binding partner may be 
conjugated to biotin and the presence of biotinylated 
protein can then be detected using streptavidin linked 
5 to an enzyme, such as horseradish peroxidase (HRP) or 
alkaline phosphatase (AP) , that can be detected 
colorometrically , or • by fluorescent tagging of 
streptavidin. An antibody directed to a C3b/C4b CR- 
like polypeptide or to a C3b/C4b CR-like binding 
10 partner and conjugated to biotin may also be used and 
can be detected after incubation with enzyme -linked 
streptavidin linked to AP or HRP. 

» 

An C3b/C4b CR-like polypeptide or a C3b/C4b CR- 
like binding partner can also be immobilized by 

15 attachment to agarose beads, acrylic beads or other 
types of such inert solid phase substrates. The 
substrate-protein complex can be placed in a solution 
containing the complementary protein and the test 
compound. After incubation, the. beads can be 

2 0 precipitated by centrif ugation, and the amount of 
binding between a C3b/C4b CR-like polypeptide and its 
binding partner can be assessed using the methods 
described herein. Alternatively, the substrate-protein 
complex can be immobilized in a column, and the test 

25 molecule and complementary protein are passed through 
the column. The formation of a complex between a 
C3b/C4b CR-like polypeptide and its binding partner can 
then be assessed using any of the techniques set forth 
herein, i.e., radiolabeliing, antibody binding, or the 

30 like. 

Another in vitro assay that is useful for 
identifying a test molecule which increases or 
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decreases the formation of a complex between a C3b/C4b 
Complement Receptor polypeptide and a C3b/C4b CR-like 
binding partner is a surface plasmon resonance detector 
system such as the BIAcore assay system (Pharmacia, 
5 Piscataway, NJ) The BIAcore system may be carried out 
using the manufacturer's protocol. This assay 

essentially involves the covalent binding of either 
C3b/C4b CR-like polypeptide or a C3b/C4b CR-like 
binding partner to a dext ran- coated sensor chip which 

10 is located in a detector. The test compound and the 
other complementary protein can then be injected, 
either simultaneously or sequentially, into the chamber 
containing the sensor chip. The amount of 

complementary protein that binds can be assessed based 

15 on the change in molecular mass which is physically 
associated with the dextran- coated side of the sensor 
chip; the change in molecular mass can be measured.- by 
the detector system. 

In some cases, it may be desirable to evaluate two 
2 0 or more test compounds together for their ability to 
increase or decrease the formation of a complex between 
a C3b/C4b CR-like polypeptide and a C3b/C4b CR-like 
binding partner. In these cases, the assays set forth 
herein can be readily modified by adding such 
25 additional test compound (s) either simultaneous with, 
or subsequent to, the first test compound. The 
remainder of the steps in the * assay are as set forth 
herein. 

In vitro assays such as those described herein may 
30 be used advantageously to screen large numbers of 
compounds for effects on complex formation by C3b/C4b 
CR-like polypeptide and C3b/C4b CR-like binding 
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; partner. The assays may be automated to screen 
compounds generated in phage display, synthetic 
peptide, and chemical synthesis libraries. 

Compounds which increase or decrease the formation 
5 of a complex between a C3b/C4b CR-like polypeptide and 
a C3b/C4b CR-like binding partner may also be screened 
in cell culture using cells and cell lines expressing 
either C3b/C4b CR-like polypeptide or C3b/C4b CR-like 
binding partner. Cells and cell lines may be obtained 
10 from any mammal, but preferably will be from human or 
other primate, canine, or rodent sources. The binding 
of a C3b/C4b CR-like polypeptide to cells expressing 
C3b/C4b CR-like binding partner at the surface is 
evaluated in the presence or absence of test molecules, 
15 and the extent of binding may be determined by, for 
example, flow cytometry using a biotinylated antibody 
to a C3b/C4b CR-like binding partner. Cell culture 
assays can be used advantageously to further evaluate 
compounds that score positive in protein binding assays 
20 described herein. 

Cell cultures can also be used to screen the 
* impact of a drug candidate. For example, drug 

candidates may decrease or increase the expression of 
the C3b/C4b CR-like gene. In certain embodiments, the 

25 amount of C3b/C4b CR-like polypeptide that is produced 
may be measured after exposure of the cell culture to 
the drug candidate. In certain embodiments, one may 
detect the actual impact of the drug candidate on the 
cell culture. For example, the overexpression of a 

3 0 particular gene may have a particular impact on the 
cell culture. In such cases, one may test a drug 
candidate's ability to increase or decrease the 
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^expression of the gene or its ability to prevent or 
inhibit a particular impact on the cell culture. In 
other examples, the production of a particular 

'metabolic product such as a fragment of a polypeptide, 
-5 may result in, or be associated with, a disease or 
pathological condition. In such cases, one may test a 
drug candidate's ability to decrease the production of 
such a metabolic product in a cell culture. 

A yeast two hybrid system (Chien et al., Proc. 

10 Natl. Acad. Sex. USA, 88^:9578-9583 (1991)) can be used 
to identify novel polypeptides that bind to, or 
interact with, C3b/C4b CR-like polypeptides. As an 
example, hybrid constructs comprising DNA encoding a 
cytoplasmic domain of a C3b/C4b CR-like polypeptide 

15 fused to a yeast GAL.4-DNA binding domain may be used as 
a two-hybrid bait plasmid. Positive clones emerging 
from the screening may be characterized further to 
identify interacting proteins. 

Internalizing Proteins 

20 The tat protein sequence (from HIV) can be used to 

internalize proteins into a cell. See e.g., Falwell et 
al., Proc. Natl. Acad. Sex., 9JL: 664-668 (1994). For 
example, an 11 amino acid sequence (YGRKKRRQRRR) of the 
HIV tat protein (termed the "protein transduction 

25 domain", or TAT PDT) has been described as mediating 
delivery across the cytoplasmic membrane and the 
nuclear membrane of a cell. See Schwarze et al., 
Science, 285 : 1569-1572 (1999); and Nagahara et al . , 
Nature Medicine, 4:1449-1452 (1998) . In these 

3 0 procedures, FITC- constructs (FITC-GGGG YGRKKRRQRRR) are 
prepared which bind to cells as observed by 
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fluorescence-activated cell sorting (FACS) analysis, 
and these constructs penetrate tissues after i.p". 
administration. Next, tat-bgal fusion proteins are 
constructed. Cells treated with this construct 

5 demonstrated b-gal activity. Following injection, a 
number of tissues, including liver, kidney, lung, 
heart, and brain tissue have been found to demonstrate 
expression using these procedures. It is believed that 
these constructions underwent some degree of unfolding 
10 in order to enter the cell; as such, refolding may be 
required after entering the cell. 

It will thus be appreciated that the tat protein 
sequence may be used to internalize a desired protein 
or polypeptide into a cell. For example, using the tat 

15 protein sequence, a C3b/C4b CR-like antagonist (such. as ' 
an anti-C3b/C4b CR-like selective binding agent, small 
molecule, soluble receptor, or antisense 

oligonucleotide) can be administered intracellularly to 
inhibit th^ activity of a C3b/C4b CR-like molecule. As 

20 used herein, the term "C3b/C4b CR-like molecule" refers 
to both C3b/C4b CR-like nucleic acid molecules and 
C3b/C4b CR-like polypeptides as defined herein. Where 
desired, the C3b/C4b CR-like protein itself may also be 
internally administered to a cell using these 

25 procedures. See also, Strauss, E., "Introducing 

Proteins Into the Body's Cells", Science, 285 1!466-1467 
(1999) . 

Therapeutic Uses 

A non-exclusive list of acute and chronic diseases 
30 which can be treated, diagnosed, ameliorated, or 
prevented with the polypeptides and nucleic acids of 
the invention is set forth below. 
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C3b/C4b CR-related polypeptides may act to 
stimulate the activation of the complement system, 
which acts alone and in conjunction with antibodies to 
destroy cells that are foreign to the host and is a 
5 main defense against bacterial and viral infections. 
The ability of a binding partner to bind to and 
activate C3b/C4b CR-related polypeptide or protein may 
lead to complement activation. Such a binding partner 
can be an agonist of C3b/C4b-CR related polypeptide or 

10 protein, such as antibody, peptibody, peptide, 
carbohydxate, polynucleotide, or small, molecular weight 
organic molecule- Agonists of C3b/C4b CR-related 
polypeptides or proteins may be used to prevent and 
treat conditions characterized by insufficient or 

15 defective complement activation, such as bacterial and 
viral infections. 

. Alternatively, it may be desix^able to use an 
antagonist of C3b/C4b CR-related polypeptide or protein 
to block complement activation. An antagonist would be 

20 useful for preventing and . treating conditions 
characterized by excessive complement activation, 
particularly -immune system disorders such as rheumatoid 
arthritis, psioriatic arthritis, inflammatory 

arthritis, osteoarthritis, inflammatory joint disease, 

25 autoimmune disease, multiple sclerosis, lupus, 
diabetes, inflammatory bowel disease, transplant 
rejection, and graft versus host disease. Antagonists 
would also be useful for prevent or treating undesired 
complement -mediated damage to cells and tissues. In 

30 one embodiment, an antagonist comprises a soluble 
domain of a C3b/C4b CR-related polypeptide or protein. 

Other uses for agonists and antagonists of C3b/C4b 
CR-like molecules include the diagnosis, prevention and 
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* treatment of nervous system disorders, such as stroke, 
Alzheimer's disease, brain injury, and Parkinson's 
disease; damaged tissues, such as by wounds and burns; 

•ischemic conditions, such as atherosclerosis, 
~5 restenosis, myocardial infarction, angioplasty, 
hypertension, and ischemia; metabolic disorders, such 
as obesity, diabetes, and cachexia; and reproductive 
disorders, infertility, miscarriage, preterm labor and 
delivery, and endometriosis. 

10 

C3b/C4b CR-like Compositions and Administration 

Therapeutic compositions are within the scope of 
the present invention. Such C3B/C4B CR-like 

pharmaceutical compositions may comprise a 

15 therapeutically effective amount of a C3b/C4b CR-like 
polypeptide or a C3b/C4b CR-like nucleic acid molecule 
in admixture with a pharmaceutical! y or physiologically 
acceptable . formulation agent selected for suitability 
with the mode of administration. Pharmaceutical 

2 0 compositions may comprise a therapeutically effective 
amount of one or more C3b/C4b CR-like selective binding 
agents in admixture with a pharmaceutically or 
physiologically acceptable formulation agent selected 
for suitability with the mode of administration. 

2 5 Acceptable formulation materials preferably are 

nontoxic to recipients at the dosages and 
concentrations employed. 

The pharmaceutical composition may contain 
formulation materials for modifying, maintaining or 
30 preserving, for example, the pH, osmolarity, viscosity, 
clarity, color, isotonicity, odor, sterility. 
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stability, rate of dissolution or release, adsorption 
or penetration of the composition. Suitable 
formulation materials include, but are not limited to, 
amino acids (such as glycine, glutamine, asparagine, 
5 arginine or lysine) , antimicrobials, antioxidants (such 
as ascorbic acid, sodium sulfite or sodium hydrogen- 
sulfite) , buffers (such as borate, bicarbonate, Tris- 
HC1, citrates, phosphates, other organic acids), 
bulking agents (such as mannitol or glycine) , chelating 

10 agents (such as ethylenediamine tetraacetic acid 
(EDT-A) ) , complexing agents (such as caffeine, 
polyvinylpyrrolidone, . beta-cyclo'dextrin or 

hydroxypropyl-beta-cyclodextrin), , fillers, 
monosaccharides, disaccharides, and other carbohydrates 

15 (such as glucose, mannose, or dextrins) , proteins (such 
as serum albumin, gelatin or immunoglobulins) , 
coloring, flavoring and diluting agents, emulsifying 
agents, hydrophilic polymers (such as 

polyvinylpyrrolidone) , low molecular weight 

20 polypeptides, salt-forming counterions (such as 
sodium) , preservatives (such as benzalkonium chloride, 
benzoic acid, salicylic acid, thimerosal, phenethyl 
alcohol, methylparaben, propylparaben, chlorhexidine , 
sorbic acid or hydrogen peroxide) , solvents (such as 

25 glycerin, propylene glycol or polyethylene glycol) , 
sugar alcohols (such as mannitol or sorbitol) , 
suspending agents, surfactants or wetting agents (such 
as pluronics, PEG, sorbitan esters, polysorbates such 
as polysorbate 20, polysorbate 80, triton, 

30 tromethamine, lecithin, cholesterol, tyloxapal) , 
stability enhancing agents (sucrose or sorbitol) , 
tonicity enhancing agents (such as alkali metal halides 
(preferably sodium or potassium chloride) , mannitol 
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'sorbitol) , delivery vehicles, diluents, excipients 
and/or pharmaceutical adjuvants. ( .Remington ' s 

Pharmaceutical Sciences, 18 th Edition, A.R. Gennaro, 

'ed., Mack Publishing Company [1990]). 

5 The optimal pharmaceutical composition will be 

UCLCJ.mJ.ncu i~ty one dj\j. j. J- cu jlxi uhc aj. u ucpc.Aivij.iiy La^OI-i, 

for example, the intended route of administration, 
delivery format, and desired dosage. See for example, 
Remington ' s Phairmaceu ti cal Sci ences , supra . Such 
10 compositions may influence the physical state, 
stability, rate of in vivo release, and rate of in vivo 
clearance of the C3b/C4b CR-like molecule. 

The primary vehicle f or carrier in a pharmaceutical 
composition may be either aqueous or non-aqueous in 

15 nature. For example, a suitable vehicle or carrier may 
be water for injection, physiological saline solution, 
or artificial cerebrospinal fluid, possibly 
supplemented with other materials common in 
compositions for parenteral administration. Neutral 

20 buffered saline or saline mixed with serum albumin are 
further exemplary vehicles. Other exemplary 

pharmaceutical compositions comprise Tris buffer of 
about pH 7.0-8.5, or acetate buffer of about pH 4.0- 
5.5, which may further include sorbitol or a suitable 

2 5 substitute therefor. In one embodiment of the present 

invention, C3b/C4b CR-like polypeptide compositions may 
be prepared for storage by. mixing the selected 
composition having the desired degree of purity with 
optional formulation agents (Remington 9 s Pharmaceutical 

3 0 Sciences, supra) in the form of a lyophilized cake or 

an aqueous solution. Further, the C3b/C4b CR-like 
polypeptide product may be formulated as a lyophilizate 
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*using appropriate excipients such as sucrose. 

The C3b/C4b CR-like pharmaceutical compositions 
can be selected for parenteral delivery. 
Alternatively, the compositions may be selected for 
.5 inhalation or for delivery through the digestive tract, 
such as orally. The preparation of such 

pharmaceutical ly acceptable compositions is within the 
skill of the art. 

The formulation components are present in 
10 concentrations that are acceptable to the site of 
administration. For example, buffers are used to 
maintain the composition at physiological pH or at 
slightly lower pH # typically within a pH range of from 
about 5 to about 8 . 

L5 When parenteral administration is contemplated, 

the therapeutic compositions for use in this invention 
may -be in the form of a pyrogen- free, parenterally 
acceptable aqueous solution comprising the desired 
C3b/C4b CR-like . molecule in a pharmaceutically 

20 acceptable vehicle. A particularly suitable vehicle 
for parenteral injection is sterile distilled water in 
which a C3b/C4b CR-like molecule is formulated as a 
sterile, isotonic solution, properly preserved. Yet 
another preparation can involve the formulation of the 

25 desired molecule with an agent, such as injectable 
microspheres, bio-erodible particles, polymeric 
compounds (polylactic acid, polyglycolic acid) , or 
beads, or liposomes, that provides for the controlled 
or sustained release of the product which may then be 

3 0 delivered as a depot injection. Hyaluronic acid may 
also be used, and this may have the effect of promoting 
sustained duration in the circulation. Other suitable 
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means for the introduction of the desired molecule 
include implantable drug delivery devices. 

In one embodiment, a pharmaceutical composition 
may be formulated for inhalation. For example, a 
5 C3b/C4b CR-like molecule may be formulated as a dry 

£r w • * «*- — - — • — — . >*.w&* • / *— — — ■ ■ — — - - — xr ^ — ' ~ J — ir -~ — — — — — 

C3b/C4b CR-like nucleic acid molecule inhalation 
solutions may also be formulated with a propellant for 
aerosol delivery. In yet another embodiment, solutions 
10 may • be nebulized. Pulmonary administration is further 
described in PCT application no. PCT/US94/001875 , which 
describes pulmonary delivery of chemically modified 
proteins . 

It is also contemplated that certain formulations 
15 may be administered orally. In one. embodiment of the 
present invention,- C3b/C4b CR-like molecules which are 
administered in this fashion can be * formulated with or 
without those carriers customarily used in the 
compounding of solid dosage forms such as tablets and 
20 capsules. For example, a capsule may be designed to 
release the active portion of the formulation at the 
point in the gastrointestinal tract when 
bioavailability is maximized and pre-systemic 
degradation is minimized. Additional agents can be 
25 included to facilitate absorption of the C3b/C4b CR- 
like molecule. Diluents; flavorings, low melting point 
waxes, vegetable oils, lubricants, suspending agents, 
tablet disintegrating agents, and binders may also be 
employed. 

3 0 Another pharmaceutical composition may involve an 

effective quantity of C3b/C4b CR-like molecules in a 
mixture with non-toxic excipients which are suitable 
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for the manufacture of tablets. By dissolving the 
tablets in sterile water, or other appropriate vehicle, 
solutions can be prepared in unit dose form. Suitable 
excipients include, but are not limited to, inert 
5 diluents, such as calcium carbonate, sodium carbonate 
or bicarbonate, lactose, or calcium phosphate; or 
binding agents, such as starch, gelatin, or acacia; or 
lubricating agents such as magnesium stearate, stearic 
acid, or talc. 

10 Additional C3b/C4b CR-like pharmaceutical 

compositions will be evident to those skilled in the 
art, including formulations involving C3b/C4b CR-like 
polypeptides in sustained- or controlled-delivery 
formulations. Techniques for formulating a variety of 

15. other sustained- or controlled-delivery means, such as 
liposome carriers, bio-erodible micropart icles or 
porous bea : ds and depot injections, are also known to 
those skilled in the art. See for example, 

PCT/US93/00S29 which describes controlled, release of 

20 porous polymeric microparticles for the delivery of 
pharmaceutical compositions. Additional examples of 
sustained- release preparations include semipermeable 
polymer matrices in the form of shaped articles, e.gr. 
films, or microcapsules. Sustained release matrices 

25 may include polyesters, hydrogels, polylactides (U.S. 
3,773,919, EP 58,481), copolymers of L-glutamic acid 
and gamma ethyl -L-glutamate (Sidman et al . , 
Biopolymers, 22:547-556 (1983)), poly (2-hydroxyethyl - 
methacrylate) (Langer et al . , J". Biomed. Mater. Res. , 

30 ]L5:167-277 (1981) and Langer, Chem. Tech., 12:98-105 
(1982)), ethylene vinyl acetate (Langer et al . , supra) 
or poly-D(-) -3 -hydroxy butyric acid (EP 133,988). 
Sustained-release compositions also may include 
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liposomes, which can be prepared by any of several 
methods known in the art. See e.g., Eppstein et al . , 
Proc, Natl. Acad. Sex. USA, 82:3688-3692 (1985); EP 

•36,676; EP 88 f 046; EP 143,949. 

5 The C3b/C4b CR-like pharmaceutical composition to 

be ussd for d. zi v_i \ r cz administration typically must be 
sterile. This may be accomplished by filtration 
through sterile filtration membranes. Where the 

composition is lyophilized, sterilization using these 

10 methods may be conducted either prior to, or following, 
lyophilization and reconstitution. The composition for 
parenteral administration may be stored in lyophilized 
form or in solution. In addition, parenteral 

compositions generally are placed into a container 

15 having a sterile access port, for example, an 
intravenous solution bag or vial having a stopper 
pierceable by a hypodermic injection needle. 

Once the pharmaceutical composition has been 
formulated, it may be stored in sterile vials as a 
20 solution, suspension, gel, emulsion, solid, or a 
dehydrated or lyophilized powder. Such formulations 
may be stored either in a ready-to-use form or in a 
form (e.g., lyophilized) requiring reconstitution prior 
to administration. 

25 In a specific embodiment, the present invention is 

directed to kits for producing a single-dose 
administration unit. The kits may each contain both a 
first container having a dried protein and a second 
container having an aqueous formulation. Also included 

3 0 within the scope of this invention are kits containing 
single and mult i- chambered pre-filled syringes (e.g., 
liquid syringes and lyosyringes) . 
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An effective amount of a C3b/C4b CR-like 
pharmaceutical composition to be employed 

therapeutically will depend, for example, upon the 
'therapeutic context and objectives. One skilled in the 
•5 art will appreciate that the appropriate dosage levels 
for treatment will thus vary depending, in part, upon 
the molecule delivered, the indication for which the 
C3b/C4b CR-like molecule is being used, the route of 
administration, and the size (body weight, body surface 
10 or organ size) and condition (the age and general 
health) of the patient. Accordingly, the clinician may 
titer the dosage and modify the route of administration 
to obtain the optimal therapeutic effect. A typical 
dosage may range from about 0.1 ^ig/kg to up to about 
15 100 mg/kg or more, depending on the factors mentioned 
above. In other embodiments, the dosage may range from 
0.1 |ig/kg up to about 100 mg/kg; or 1 j^g/kg up to about 
100 mg/kg; or 5 jig/kg up to about 10 0 mg/kg. 

: The frequency of dosing will depend upon the 
20 pharmacokinetic parameters of the • C3b/C4b CR-like 
molecule in the formulation used. Typically, a 

clinician will administer the composition* until a 
dosage is reached that achieves the desired effect. 
The composition may therefore be administered as a 

2 5 single dose, or as two or more doses (which may or may 

not contain the same amount of the desired molecule) 
over time,, or as a continuous infusion via implantation 
device or catheter. Further refinement of the 

appropriate dosage is routinely made by those of 

3 0 ordinary skill in the art and is within the ambit of 

tasks routinely performed by them. Appropriate dosages 
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may be ascertained through use of appropriate dose- 
response data. 

The route of administration of the pharmaceutical 
composition is in accord with known methods, e.g. oral # 
5 injection by intravenous, intraperitoneal, 

intracerebral ( intra-parenchymai ) , 

intracerebroventricular , intramuscular, intra -ocular , 
intraarterial, intraportal, or intralesional routes, or 
by sustained release systems or implantation device. 
10 Where desired, the compositions may be administered by 
bolus injection or continuously by infusion, or by 
implantation device. 

Alter-natively or additionally, the composition may . \: 
be administered locally via implantation of a membrane, 

15 sponge, or other appropriate material on to which the 
desired molecule has been absorbed or encapsulated. 
Where an implantation device is used, the device may be 
implanted into any suitable tissue or organ, and 
delivery of the desired molecule may be via diffusion, 

20 timed release bolus, or continuous administration. 

In some cases, it may be desirable to use C3b/C4b 
CR-like pharmaceutical compositions in an ex vivo 
manner. In such instances, cells, tissues, or organs 
that have been removed from the patient are exposed to 
25 C3b/C4b CR-like pharmaceutical compositions after which 
the cells, tissues and/ or organs are subsequently 
implanted back into the patient. 

In other cases, a C3b/C4b CR-like polypeptide can 
be delivered by implanting certain cells that have been 
30 genetically engineered, using methods such as those 
described herein, to express and secrete the 
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polypeptide. Such cells may be animal or human cells, 
and may be autologous, heterologous, or xenogeneic. 
Optionally, the cells may be immortalized. In order to 
decrease the chance of an immunological response, the 
5 cells may be encapsulated to avoid infiltration of 
surrounding tissues. The encapsulation materials are 
typically biocompatible, semi -permeable polymeric 
enclosures or membranes that allow the release of the 
protein product (s) but prevent the destruction of the 
10 cells by the patient's immune system or by other 
detrimental factors from the surrounding tissues. 

Additional embodiments of the present invention 
relate to cells and methods (e.g., homologous 

15 recombination and/or other recombinant production 
methods) for both the in. vitro production of 
therapeutic polypeptides and for the production and 
delivery of therapeutic polypeptides by gene therapy or 
cell therapy. Homologous and other recombination 

2 0 methods may be used to modify a cell that contains a 
normally transcriptionally silent C3b/C4b CR-like gene, 
or an under expressed gene, and thereby produce a cell 
which expresses therapeutically efficacious amounts of 
C3b/C4b CR-like polypeptides. 

25 Homologous recombination is a technique originally 

developed for targeting genes to induce or correct 
mutations in transcriptionally active genes 
(Kucherlapati, Prog, in Nucl . Acid Res. & Mol. Biol., 
36:301, 1989). The basic technique was developed as a 

30 method for introducing specific mutations into specific 
regions of the mammalian genome (Thomas et al., Cell, 
4ji: 419-428, 1986; Thomas and Capecchi , Cell, 51:503- 
512, 1987; Doetschman et al . , Proc . Natl. Acad. Sci . , 
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" ; 85> :8583-8587, 1988) or to correct specific mutations 
within defective genes (Doetschman et al . , Nature, 
330 :576-578, 1987) . Exemplary homologous recombination 
•techniques are described in U.S. Patent No. 5,272,071 
•5 (EP 9193051, EP Publication No. 505500; PCT/US90/07642 , 
International Publication No. WO 91/09955) . 

Through homologous recombination, the DNA sequence 
to be inserted into the genome can be directed tc a 
specific region of the gene of interest by attaching it 
10 to targeting DNA. The targeting DNA is a .nucleotide 
sequence that is complementary (homologous) to a region 
of the genomic DNA. Small pieces of targeting DNA that 
are complementary to a specific region of the genome 
are put in contact with the parental strand during the 
15 DNA replication process. It is a general property of 
DNA that has been inserted into a cell to hybridize, 
and therefore, recombine with other pieces of 
endogenous DNA through shared homologous regions. If 
this complementary strand is attached to an 
20 oligonucleotide that contains a mutation or a different 
sequence or an additional nucleotide, it too is 
incorporated into the newly synthesized strand as a 
result of the recombination. As a result of the 
proofreading function, it is possible for the new 
25 sequence of DNA to serve as the template. Thus, the 
transferred DNA is incorporated into the genome. 

Attached to these pieces of targeting DNA are 
regions of DNA which may interact with or control the 
expression of a C3b/C4b CR-like polypeptide, e.g., 
30 flanking sequences. For example, a promoter/enhancer 
element, a suppresser, or an exogenous transcription 
modulatory element is inserted in the genome of the 
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'intended host cell in proximity and orientation 
sufficient to influence the transcription of DNA 
encoding the desired C3b/C4b CR-like polypeptide. The 

'control element controls a portion of the DNA present 
•5 in the host cell genome. Thus', the expression of the 
desired C3b/C4b CR-like polypeptide may be achieved not 
by transfection of DNA that encodes the C3b/C4b CR-like 
gene itself, but rather by the use of targeting DNA 
(containing regions of homology with the endogenous 
10 gene of interest) coupled with DNA regulatory segments 
that provide the endogenous gene sequence with 
recognizable signals for transcription of a C3b/C4b CR- 
like polypeptide. 

In an exemplary method, the expression of- a 
15 desired targeted gene in a cell (i.e., a desired 
endogenous cellular gene) is altered via homologous 
recombination into the cellular genome at a preselected 
site, by the introduction of . DNA which includes., at 
least a regulatory sequence, an- exon and a splice donor 
20 site. These components are introduced into the 

chromosomal (genomic) DNA in such a manner that this, 
in effect, results in the production of a new 
transcription unit (in which the regulatory sequence, 
the exon and the splice donor site present in the DNA 

2 5 construct are operatively linked to the endogenous 

gene) . As a result of the introduction of these 
components into the chromosomal DNA, the expression of 
the desired endogenous gene is altered. 

Altered gene expression, as described herein, 

3 0 encompasses activating (or causing to be expressed) a 

gene which is normally silent (unexpressed) in the cell 
as obtained, as well as increasing the expression of a 
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gene which is not expressed at physiologically 
significant levels in the cell as obtained. The 
embodiments further encompass changing the pattern of 
regulation or induction such that it is different from 
5 the pattern of regulation or induction that occurs in 
the cell as obtained, and reducing (including 
eliminating) the expression of a. gene which is 
expressed in the cell as obtained. 

One method by which homologous recombination can 

10 be used to increase,, or cause, C3b/C4b CR-like 
polypeptide production from a cell's endogenous C3b/C4b 
CR-like gene involves first using homologous 
recombination to place a recombination sequence front a 
site-specific recombination system (e.g., Cre/loxP, 

15 FLP/FRT) (Sauer, Current Opinion In Biotechnology, 
5:521-527, 1994; Sauer, Methods In Enzymology, 225 : 8-90- 
900, 1993) upstream (that is, 5' to) of the cell's 
endogenous genomic C3b/C4b CR-like polypeptide* coding 
region. A plasmid containing a recombination site 

20 homologous, to the site that was placed just upstream' of 
the genomic C3b/C4b CR-like polypeptide coding region 
is introduced into the modified cell line along with 
the appropriate recombinase enzyme. This recombinase 
causes the plasmid to integrate, via the plasmid' s 

25 recombination site, into the recombination site located 
just upstream of the genomic C3b/C4b CR-like 
polypeptide coding region in the cell line (Baubonis 
and Sauer, Nucleic Acids Res., 21:2025-2029, 1993; 
O'Gorman et al . , Science, 251 :1351-1355, 1991). Any 

3 0 flanking sequences known to increase transcription 
(e.g., enhancer /promoter , intron, translational 
enhancer) , if properly positioned in this plasmid, 
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would integrate in such a manner as to create a new or 
modified transcriptional unit resulting in de novo or 
increased C3b/C4b CR-like polypeptide production from 
the cell's endogenous C3b/C4b CR-like gene. 

5 A further method to use the cell line in which the 

site specific recombination sequence had been placed 
just upstream of the cell's endogenous genomic C3b/C4b 
CR-like polypeptide coding region is to use homologous 
recombination to introduce a second recombination site 

10 elsewhere in the cell line's genome. The appropriate 
recombinase enzyme is then introduced into the two- 
recombination- site cell line, causing a recombination 
event (deletion, inversion, translocation) (Sauer, 
Current Opinion In Biotechnology, supra, 1994; Sauer, 

15 Methods In Enzymology, sup:ca, 1993) that would create a 
new. . or modified transcriptional unit resulting in de 
novo or increased C3b/C4b' CR-like polypeptide 
production from the cell's endogenous C3b/C4b CR-like 
gene . 

20 An additional approach for increasing, or causing, 

the expression of C3b/C4b CR-like polypeptide from a 
cell's endogenous C3b/C4b CR-like gene involves 
increasing, or causing, . the expression of a gene or 
genes (e.g., transcription factors) and/or decreasing 
25 the expression of a gene or genes (e.g., 
transcriptional repressors) in a manner which results 
in de novo or increased C3b/C4b CR-like polypeptide 
production from the cell's endogenous C3b/C4b CR-like 
gene. This method includes the introduction of a non- 
30 naturally occurring polypeptide (e.g., a polypeptide 
comprising a site specific DNA binding domain fused to 
a transcriptional factor domain) into the cell such 
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" > that de novo or increased C3b/C4b CR-like polypeptide 
production from the cell's endogenous C3b/C4b CR-like 
gene results. 

The present invention further relates to DNA 
5 constructs useful in the method of altering expression 
of a target gene. In certain embodiments, the 

exemplary DNA constructs comprise: (a) one or more 
targeting sequences; (b) a regulatory sequence; (c) an 
exon; and (d) an unpaired splice-donor site. The 

10 targeting sequence in the DNA construct directs ! the 
integration of elements (a) -(d)' into a target gene in a 
cell such that the elements (b) - (d) are operatively 
linked to sequences of the endogenous target gene. In 
another embodiment, the DNA constructs comprise: (a) 

15 one or more targeting, sequences, (b) a regulatory 
sequence, (c) an exon, (d) a . splice -donor site, (e) an 
intron, and (f) a splice-acceptor site, wherein the 
targeting sequence directs the integration of elements 
(a) - (f ) such that the elements of (b) - (f ) are 

2 0 operatively linked to the endogenous gene. The 
targeting sequence is homologous to the preselected 
site in the cellular chromosomal DNA with which 
homologous' recombination is to occur. In the 

construct, the exon is generally 3' of the regulatory 

25 sequence and the splice-donor site is 3' of the exon. 

If the sequence of a particular gene is known, 
such as the nucleic acid sequence of C3b/C4b CR-like 
polypeptide presented herein, a piece of DNA that is 
complementary to a selected region of the gene can be 
30 synthesized or otherwise obtained, such as by 
appropriate restriction of the native DNA at specific 
recognition sites bounding the region of interest. 
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J This piece serves as a targeting sequence (s) upon 
insertion into the cell and will hybridize to its 
homologous region within the genome. If this 

'hybridization occurs during DNA replication, this piece 
k 5 of DNA, and any additional sequence attached thereto, 
will act as an Okazaki fragment and will be 
incorporated into the newly synthesized daughter strand 
of DNA . The present invention, therefore, includes 
nucleotides encoding a C3b/C4b CR-like polypeptide, 
10 which nucleotides may be used as targeting sequences. 

. C3b/C4b CR-like polypeptide cell therapy, e.g., 
the implantation of cells producing C3b/C4b CR-like 
polypeptides, is also contemplated. This embodiment 
involves implanting cells capable of synthesizing and 

15 secreting a biologically active form of C3b/C4b CR-like 
polypeptide. Such C3b/C4b CR-like polypeptide- 

. producing cells can be cells that are natural producers 
of C3b/C4b CR-like polypeptides or may be recombinant 
cells whose ability to produce C3b/C4b CR-like 

20 polypeptides has been augmented by transformation with 
a gene encoding the desired C3b/C4b CR-like polypeptide 
or with a gene augmenting the expression of C3b/C4b CR- 
like polypeptide. Such a modification may be 
accomplished by means of a vector suitable for 

25 delivering the gene as well as promoting its expression 
and secretion. In order to minimize a potential 
immunological reaction in patients being administered a 
C3b/C4b CR-like polypeptide, as may occur with the 
administration of a polypeptide of a foreign species, 

30 it is preferred that the natural cells producing 
C3b/C4b CR-like polypeptide be of human origin and 
produce human C3b/C4b CR-like polypeptide. Likewise, 
it is preferred that the recombinant cells producing 
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C3b/C4b CR-like polypeptide be transformed with an 
expression vector containing a gene encoding a human 
C3b/C4b CR-like polypeptide. 

Implanted cells may be encapsulated to avoid the 
5 infiltration of surrounding tissue. Human or non-human 
animal cells may be implanted in patients in 
biocompatible, semipermeable polymeric enclosures or 
membranes that allow the release of C3b/C4b CR-like 
polypeptide, but that prevent the destruction of the* 

10 cells by the patient's immune system or by other 
detrimental factors from the surrounding tissue. 
Alternatively, the patient's own cells, transformed to 
produce C3b/C4b CR-like polypeptides ex vivo, may be 
implanted - directly into the patient without such 

15' encapsulation. 

Techniques for the encapsulation of living cells 
are known in the art, and the preparation of the 
encapsulated cells and their implantation in patients 
may be routinely accomplished. For example, Baetge et 

20 al. (WO95/05452; PCT/US94/09299) describe membrane 
capsules containing genetically engineered cells for 
the effective delivery of biologically active 
molecules. The capsules are biocompatible and are 
easily retrievable. The capsules encapsulate cells 

25 transfected with recombinant DNA molecules comprising 
DNA sequences coding for biologically active molecules 
operatively linked to promoters that are not subject to 
down regulation in vivo upon implantation into a 
mammalian host. The devices provide for the delivery 

30 of the molecules . from living cells to specific sites 
within a recipient. In addition, see U.S. Patent Nos . 
4,892,538, 5,011,472, and 5,106,627. A system for 
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J encapsulating living cells is described in PCT 
Application no. PCT/US91/00157 of Aebischer et a J . See 
also, PCT Application no. PCT/US91/00155 of Aebischer 

' et a!., Winn et al . , Exper. Neurol., 113 : 322-329 
'5 (1991), Aebischer et al . , Exper. Neurol., Ill :269-275 
(1991); and Tresco et al . , ASAIO, 38:17-23 (1992). 

Jn vivo and in vitro gene therapy delivery of 
C3b/C4b CR-like polypeptides is also envisioned. One 
example of a gene therapy technique is to use the 

10 C3b/C4b CR-like gene (either genomic DNA, cDNA, and/or 
synthetic DNA) encoding a C3b/C4b CR-like polypeptide 
which may be operably linked to a constitutive or 
inducible promoter to form a "gene therapy DNA 
construct" . The promoter may be homologous or 

15 heterologous to the endogenous C3b/C4b CR-like gene, 
provided that it is active in the cell or tissue type 
into which the construct will be inserted; Other 
components of the gene therapy DNA construct may 
optionally include, DNA molecules designed for site- 

20 specific integration (e.g., endogenous sequences useful 
for homologous recombination) , tissue-specific 
promoter, enhancer(s) or silencer(s) , DNA molecules 
capable of providing a selective advantage over the 
parent cell, DNA molecules useful as labels to identify 

25 transformed cells, negative selection systems, cell 
specific binding agents (as, for example, for cell 
targeting) , cell-specific internalization factors, and 
transcription factors to enhance expression by a vector 
as well as factors to enable vector manufacture. 

30 A gene therapy DNA construct can then be 

introduced into cells (either ex vivo or in vivo) using 
viral or non-viral vectors. One means for introducing 
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the gene therapy DNA construct is by means of viral 
vectors as described herein. Certain vectors, such as 
retroviral vectors, will deliver the DNA construct to 
the chromosomal DNA of the cells, and the gene can 
5 integrate into the chromosomal DNA. Other vectors will 
function as epi somes, and the gene therapy DNA 
construct will remain in the cytoplasm. 

In yet other embodiments; regulatory elements can 
be included for. the controlled expression of the 

10 C3b/C4b CR-like gene in the target cell. Such elements 
are turned on in response to an appropriate effector. 
In this way, a therapeutic polypeptide can be expressed 
when desired. One conventional control means involves 
the use of small molecule dimerizers or rapalogs (as 

15 described in W09641865 (PCT/US96/099486) ; W09731898 
(PCT/US97/03137) and W09731899. (PCT/US95/03 157) used to 
dimerize chimeric proteins which contain a small 
molecule -binding domain and a domain capable of 
initiating biological process, such as a DNA-binding 

20 protein or transcriptional activation protein. The 
dimerization of the proteins can be used to initiate 
transcription of the transgene. 

An alternative regulation technology uses a method 
of storing proteins expressed from the gene of interest 

25 inside the cell as an aggregate or cluster. The gene 
of interest is expressed as a fusion protein that 
includes a conditional aggregation domain which results 
in the retention of the aggregated protein in the 
endoplasmic reticulum. . The stored proteins are stable 

30 and inactive inside the cell. The proteins can be 
released, however, by administering a drug (e.g., small 
molecule ligand) that removes the conditional 
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aggregation domain and thereby specifically breaks 
apart the aggregates or clusters so that the proteins 
may be secreted from the cell. See, Science 287 : 816- 
817, and 826-830 (2000) . 

5 Other suitable control means or gene switches 

include, but are not limited to, the following systems. 
Mifepristone (RU486) is used as a progesterone 
antagonist. The binding of a modified progesterone 
receptor ligand-binding domain to the progesterone 
10 antagonist activates transcription by forming a dimer 
of two transcription factors which then pass into the 
nucleus to bind DNA. The ligand binding domain is 
modified to eliminate the ability of the receptor to 
bind to the natural ligand. The modified steroid . 
15 hormone receptor system is further described in U.S. . 
5,364,791; WO9640911, and WO9710337. 

Yet another control system uses ecdysone (a. fruit 
fly steroid hormone) which binds to and activates an : 
ecdysone receptor (cytoplasmic receptor) . The receptor .-; 
then translocates to the nucleus to bind a specific DNA 
response element (promoter from ecdysone -responsive 
gene) . The ecdysone receptor includes a 

transact ivat ion domain/DNA-binding domain/1 igand- 
binding domain to initiate transcription. The ecdysone 
system is -further described in U.S. 5,514,578; 
W09738117; WO9637609; and WO9303162. 

Another control means uses a positive 
tetracycline-controllable transactivator . This system 
involves a mutated tet repressor protein DNA-binding 
30 domain (mutated tet . R-4 amino acid changes which 
resulted in a reverse tetracycline-regulated 
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* transact ivator protein, i.e., it binds to a tet 
operator in the presence of tetracycline) linked to a 
polypeptide which activates transcription. Such 
-systems are described in U.S. Patent Nos. 5,464,758; 
.5 5,650,298 and 5,654,168. 

Additional expression control systems and nucleic 
acid constructs are described in U.S. Patent Nos. 
5,741,679 and 5,834,186, to Innovir Laboratories Inc. 

In vivo gene therapy may be accomplished by 
10 introducing the gene encoding a C3b/C4b CR-like 
polypeptide into cells via local injection of a C3b/C4b 
CR-like nucleic acid molecule or by. other appropriate 
viral or non-viral delivery vectors. Hefti, 
Neurobiology, 25.: 1418-1435 (1994) . . For example, a 
15 nucleic acid molecule encoding a C3b/C4b CR-like 
polypeptide may be contained in an adeno-associated 
virus (AAV) vector for delivery to the targeted cells 
(e.g., Johnson, International Publication. No. 
WO95/34670; International Application No. 

20 PCT/US95/07178) . The recombinant AAV genome typically 
coritains AAV inverted terminal repeats flanking a DNA 
sequence encoding a C3b/C4b CR-like polypeptide 
operably linked to functional promoter and 
polyadenylation sequences. 

25 Alternative suitable viral vectors include, but 

are not limited to, retrovirus, adenovirus, herpes 
simplex virus, lentivirus, hepatitis virus, parvovirus, 
papovavirus, poxvirus, alphavirus, coronavirus, 
rhabdovirus, paramyxovirus, and papilloma virus 

3 0 vectors. U.S. Patent No. 5,672,344 describes an in 
vivo viral -mediated gene transfer system involving a 
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5 recombinant neurotrophic HSV-1 vector. U.S. Patent No. 
5,399,346 provides examples of a process for providing 
a patient with a therapeutic protein by the delivery of 

'human cells which have been treated in vitro to insert 
-5 a DNA segment encoding a therapeutic protein. 
Additional methods and materials for the practice of 
gene therapy techniques are described in U.S. Patent , 
No. 5,631,236 involving adenoviral vectors; U,S. Patent 
No. 5,672,510 involving retroviral vectors; and U.S. 
10 5,635,3 99 involving retroviral vectors expressing 
cytokines. 

Nonviral delivery methods include, but are not 
J limited to, liposome -mediated transfer, naked DNA 

delivery (direct injection) , receptor-mediated transfer 

15 (ligand-DNA complex) , electroporation, ca3.cium 
phosphate precipitation, and microparticle bombardment 
(e.gr., gene gun).. Gene therapy materials and methods 
may also* include the use of inducible promoters, 
tissue-specific enhancer-promoters, DNA sequences 

20 designed for site- specif ic integration, DNA sequences 
capable of providing a selective advantage over the 
parent cell, labels to identify transformed cells, 
negative selection systems and expression control 
systems (safety measures) , cell-specific binding agents 

25 (for cell targeting) , cell-specific internalization 
factors, and transcription factors to enhance 
expression by a vector as well as methods of vector 
manufacture. Such additional methods and materials for 
the practice of gene therapy techniques are described 

30 in U.S. Patent No. 4,970,154 involving electropox^ation 
techniques; WO96/4 0958 involving nuclear ligands; U.S. 
Patent No. 5,679,559 describing a lipoprotein- 
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containing system for gene delivery; U.S. Patent No. 
5,676,954 involving liposome carriers; U.S. Patent No. 
5 , 593 , 875 concerning methods for calcium phosphate 
transf ection; and U.S. Patent No. 4,945,050 wherein 
5 biologically active particles are propelled at cells at 
a speed whereby the particles penetrate the surface of 
the cells and become incorporated into the interior of 
the cells. 

It is also contemplated that C3b/C4b CR-like gene 
10 therapy or cell therapy can further include the 
delivery of one or more additional polypeptide (s) in 
the same or a different cell (s) . Such cells may be 
separately introduced into the patient, or the cells 
may be contained in a single implantable device, such 
15 as the encapsulating membrane described above, or the 
cells may be separately modified by means of viral 
vectors . 

A means to increase endogenous C3b/C4b CR-like 
polypeptide expression in a cell via gene therapy is to 

20 insert one or more enhancer elements into the C3b/C4b 
CR-like polypeptide promoter, where the enhancer 
element (s) can serve to increase transcriptional 
activity of the C3b/C4b CR-like gene. The enhancer 
element (s) used will be selected based on the tissue in 

25 which one desires to activate the gene(s); enhancer 
elements known to confer promoter activation in that 
tissue will be selected. For example, if a gene 
encoding a C3b/C4b CR-like polypeptide is to be "turned 
on" in T-cells, the lck promoter enhancer element may 

30 be used. Here, the functional portion of the 

transcriptional element to be added may be inserted 
into a fragment of DNA containing the C3b/C4b CR-like 
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polypeptide promoter (and optionally, inserted into a 
vector and/or 5' and/or 3' flanking sequence (s) # etc.) 
using standard cloning techniques. This construct, 
known as a ^homologous recombination construct" , can 
5 then be. introduced into the desired cells either ex 
vivo or in vivo. 

Gene therapy also can be used to decrease C3b/C4b 
CR-like polypeptide expression by modifying the 
nucleotide sequence of the endogenous promoter (s) . 

10 Such modification is typically accomplished via 
homologous recombination methods. For example, a DNA 
molecule containing all or a portion of the promoter of 
the C3b/C4b CR-like gene(s) selected for inactivation 
can be engineered to remove and/or replace pieces of 

15 the promoter that regulate transcription. For example 
the TATA box and/or the binding site of a 
transcriptional activator of the promoter may be 
deleted using standard molecular biology techniques; 
such deletion can inhibit promoter • activity thereby 

20 repressing the transcription ' of the corresponding 
C3b/C4b CR-like gene. The deletion of the TATA box or 
the transcription activator binding site in the 
promoter may be accomplished by generating a DNA 
construct comprising all or the relevant portion of the 

25 C3b/C4b CR-like polypeptide promoter (s) (from the same 
or a related species as the C3b/C4b CR-like gene(s) to 
be regulated) in which- one or more of the TATA box 
and/or transcriptional activator binding site 
nucleotides are mutated via substitution, deletion 

30 and/or insertion of one or more nucleotides. As a 
result, the TATA box and/or activator binding site has 
decreased activity or is rendered completely inactive. 
The construct will typically contain at least about 500 



114 



WO 02/10199 



PCT/US01/23232 



J bases of DNA that correspond to the native (endogenous) 
5' and 3' DNA sequences adjacent to the promoter 
segment that has been modified. The construct may be 
, introduced into the appropriate cells (either ex vivo 
.5 or in vivo) either directly or via a viral vector as 
described herein. Typically, the integration of the 
construct into the genomic DNA of the cells will be via 
^homologous recombination, where the 5' and 3' DNA 
sequences in the promoter construct can serve to help 
10 integrate the modified promoter region via 
hybridization to the endogenous chromosomal DNA. 

Additional Uses of C3b/C4b CR-like Nucleic Acids and 
Polypeptides 

15 Nucleic acid molecules of the -.present invention 

(including those that do not themselves encode 
biologically active polypeptides) may be used to map 
the locations of the C3b/C4b CR-like gene and related 
genes on. chromosomes; Mapping may . be done by 

20 techniques known in the art/ such as.PCR amplification 
and -in situ hybridization. 

C3b/C4b CR-like nucleic acid mol.ecules (including 
those that do not themselves encode biologically active 
polypeptides) , may be useful as hybridization probes in 
25 diagnostic assays to test, either qualitatively or 
quantitatively, for the presence of a C3b/C4b CR-like 
DNA or corresponding RNA in mammalian tissue or bodily 
fluid samples. 

The C3b/C4b CR-like polypeptides may be used 
30 (simultaneously or sequentially) in combination with 
one or more cytokines, growth factors, antibiotics, 
anti- inflammatories, and/or chemotherapeutic agents as 
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is appropriate for the indication being treated. 

Other methods may also be employed where it is 
desirable to inhibit the activity of one or more 
C3b/C4b CR-like polypeptides. Such inhibition may be 
5 effected by nucleic acid molecules which are 
complementary to and hybridize to expression control 
sequences (triple helix formation) or to C3b/C4b CR- 
like mRNA. For example, ant i sense DNA or RNA 
molecules, which have a sequence that is complementary 

10 to at least a portion of the selected C3b/C4b CR-like 
gene(s) can be introduced into the cell. Anti-sense 
probes may be designed by available techniques using 
the sequence of C3b/C4b CR-like polypeptide disclosed 
herein. Typically, each such antisense molecule will 

15 be complementary to the start site (5' end) of each 
selected C3b/C4b CR-like gene. When the antisense 
molecule then hybridizes to the corresponding C3b/C4b 
CR-like mRNA, translation of this mRNA is prevented or 
a:educed. Anti-sense inhibitors provide information 

20 ' relating to the decrease or absence of a C3b/C4b CR- 
like polypeptide in a cell or organism. 

Alternatively, gene therapy may be employed to 
create a dominant -negative inhibitor of one or more 
C3b/C4b CR-like polypeptides. In this situation, the 

25 DNA encoding a mutant polypeptide of each selected 
C3b/C4b CR-like polypeptide can be prepared and 
introduced into the cells of a patient using either 
viral or non-viral methods as described herein. Each 
such mutant is typically designed to compete with 

30 endogenous polypeptide in its biological role. 

In addition, a C3b/C4b CR-like polypeptide, 
whether biologically active or not, may be used as an 
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J immunogen, that is, the polypeptide contains at least 
one epitope to which antibodies may be raised. 
Selective binding agents that bind to a C3b/C4b CR-like 
< polypeptide (as described herein) may be used for in 
• 5 vivo and in vitro diagnostic purposes, including, but 
not limited to, use in labeled form to detect the 
presence of C3b/C4b CR-like polypeptide in ei body fluid 
* or cell sample* The antibodies may also be used to 
prevent, treat, or diagnose a number of diseases and 
10 disorders, including those recited herein. The 
antibodies may bind to a C3b/C4b CR-like polypeptide so 
as to diminish or block at least one activity 
characteristic of a C3b/C4b CR-like polypeptide, or may 
bind to a polypeptide to increase at least one activity 
15 characteristic of a . C3b/.C4b CR-like polypeptide 
(including by increasing the pharmacokinetics of the 
C3b/C4b CR-like polypeptide) . 
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W HAT IS CLAIMED 

1. An isolated nucleic acid molecule comprising a 
nucleotide sequence selected from the grou£> consisting 

;;; of: 

5 (a) the nucleotide sequence as set forth in SEQ 

ID NO:l, SEQ ID NC:3, or SEQ ID NO : 6 ; 

(b) a nucleotide sequence encoding the 
polypeptide as set forth in SEQ ID NQ y :2, SEQ ID NO: 4, 
or SEQ ID NO: 7; 

10 (c) a nucleotide sequence which hybridizes under 

moderately or highly stringent conditions to the 
complement of (a) or (b) , wherein the encoded 
polypeptide has an activity of the polypeptide as set 
forth in SEQ ID NO: 2, SEQ ID NO:4, or SEQ ID NO: 7; and 

15 (d) a nucleotide sequence complementary to any of 

(a) -(c). 

2. An isolated nucleic acid molecule comprising a 
nucleotide sequence selected from the group consisting 

20 of: 

(a) a nucleotide sequence encoding a polypeptide 
that is at least about 70, 75, 80, 85, 90, 95, 96, 97, 
98, or 99 percent identical to the polypeptide as set 
forth in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID NO: 7, 

25 wherein the polypeptide has an activity of the 
polypeptide as set forth in SEQ ID NO: 2, SEQ ID NO: 4, 
or SEQ ID NO: 7; 

(b) a nucleotide sequence encoding an allelic 
variant or splice variant of the nucleotide sequence as 

30 set forth in SEQ ID NO:l, SEQ ID NO:3, or SEQ ID NO:6, 
wherein the encoded polypeptide has an activity of the 
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polypeptide as set forth in SEQ ID NO: 2, SEQ ID NO: 4, 
or SEQ ID NO: 7; 

(c) a nucleotide sequence of SEQ ID NO:l, SEQ ID 
NO: 3, or SEQ ID NO: 6; (a); or (b) encoding a 

5 polypeptide fragment of at least about 25 amino acid 
residues j wherein the polypeptide has an activity of 
the polypeptide as set forth in SEQ ID NO:2, SEQ ID 
NO:4, or SEQ ID NO: 7; 

(d) a nucleotide sequence of SEQ ID NO:l, SEQ ID 
10 NO:3, or SEQ ID NO:6, or (a) - (c) comprising a fragment 

of at least about 16 nucleotides; 

(e) a nucleotide sequence which hybridizes under 
moderately or highly stringent conditions to the 
complement of any of (a) - (d) , wherein the polypeptide 

15 has an activity of the polypeptide as set forth in SEQ 
ID NO: 2, SEQ ID NO: 4 , or SEQ ID NO : 7 ; and 

(f ) a nucleotide sequence complementary to any of 
(a) -(c). 

20 3. An isolated nucleic acid molecule comprising a 

nucleotide sequence selected from the group consisting 
of: 

(a) a nucleotide sequence encoding a polypeptide 
as set forth in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID 

25 NO:7 # with at least one conservative amino acid 
substitution, wherein the polypeptide has an activity 
of the polypeptide as set forth in SEQ ID NO:2, SEQ ID 
NO:4, or SEQ ID NO:7; 

(b) a nucleotide sequence encoding a polypeptide 
3 0 as set forth in SEQ ID NO: 2, SEQ ID NO : 4 , or SEQ ID 

NO: 7 with at least one amino acid insertion, wherein 
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the polypeptide has an activity of the polypeptide as 
set forth in SEQ ID NO: 2, SEQ ID NO: 4 , or SEQ ID NO : 7 ; 

(c) a nucleotide sequence encoding a polypeptide 
as set forth in SEQ ID NO: 2, SEQ ID NO:4, or SEQ ID 

5 . NO: 7 with at least one amino acid deletion, wherein the 
polypeptide has an activity of the polypeptide as set 
forth in SEQ ID NO: 2,. SEQ ID NO: 4, or SEQ ID- NO: 7; 

(d) a nucleotide sequence encoding a polypeptide 
as set forth in SEQ ID NO: 2, SEQ ID NO:4, or SEQ ID 

10 NO: 7 which has a C- and/or N- terminal truncation, 
wherein the polypeptide has an activity of the 
polypeptide as set forth in SEQ ID NO:2, SEQ ID NO: 4, 
or SEQ ID NO: 7; 

(e) a nucleotide . sequence encoding a polypeptide 
15 as set forth in SEQ ID NO:2, SEQ ID NO:4, or SEQ. ID 

NO:7 with at least one modification selected from the 
group consisting of amino acid substitutions, amino 
acid insertions, amino acid deletions, C- terminal 
truncation, and N-terminal truncation, wherein the 
20 polypeptide has an activity of the polypeptide as set 
forth in SEQ ID NO: 2, SEQ ID NO:4, or SEQ ID NO: 7; 

(f) a nucleotide sequence of (a) - (e) comprising a 
fragment of at least about 16 nucleotides; 

(g) a nucleotide sequence which hybridizes under 
25 moderately or highly stringent conditions to the 

complement of any of (a) -(f), wherein the polypeptide 
has an activity of the polypeptide as set forth in SEQ 
ID NO: 2, SEQ ID NO:4, or SEQ ID NO : 7 ; and 

(h) a nucleotide sequence complementary to any of 
30 (a) - (e) . 
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~~ ' 4. A vector comprising the nucleic acid molecule 

of Claims 1, 2, or 3 . 

5. A host cell comprising the vector of Claim 4. 

■—5 

6. The host cell of Claim 5 that is a eukaryotic 

cell . 

7. The host cell of Claim 5 that is a prokaryotic 

10 cell. 

8. A process of producing a C3b/C4b CR-like 
polypeptide comprising cul.turing the host cell of Claim 
5 under suitable conditions to express the polypeptide, 

15 and optionally isolating the polypeptide from the 
culture. 

9. A polypeptide produced by the process of Claim 

8. 

20 

10. The process of Claim 8, wherein the nucleic 
acid molecule comprises promoter DNA other than the 
promoter DNA for the native C3b/C4b CR-like polypeptide 
operatively linked to the DNA encoding the C3b/C4b CR- 

25 like polypeptide. 

11. The isolated nucleic acid molecule according 
to Claim 2 wherein the percent identity is determined 
using a computer program selected from the group 

30 consisting of GAP , BLASTP, BLASTN, FASTA, BLASTA, 
BLASTX, BestFit, and the Smith-Waterman algorithm. 
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12. A process for determining whether a compound 
inhibits C3b/C4b CR-like polypeptide activity or 
production comprising exposing a cell according to 
'Claims 5, 6, or 7 to the compound, and measuring 

-5 C3b/C4b CR-like polypeptide activity or production in 
said cell. 

13. An isolated polypeptide comprising the amino 
acid sequence set forth in SEQ ID NO: 2 , SEQ ID NO; 4, or 

10 SEQ ID NO: 7. 

14. An isolated polypeptide comprising the amino 
acid sequence selected from the ' group consisting of: 

(a) an amino acid sequence of the mature C3b/C4b 
15 CR-like polypeptide wherein the mature polypeptide 

comprises the amino acid sequence contained within SEQ 
ID NO: 2, SEQ ID NO : 4 , or SEQ ID NO: 7, and optionally 
further comprises an amino- terminal methionine; 

(b) an amino acid sequence for an ortholog of SEQ 
2 0 ID NO: 2, SEQ ID NO:4, or SEQ ID NO: 7, wherein the 

encoded polypeptide has an activity of the polypeptide 
as set forth in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID 
NO : 7 ; 

(c) an amino acid sequence that is at least about 
25 70, 80, 85, 90, 95, 96, 97, 98, or 99 percent identical 

to the amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, 
or SEQ ID NO: 7, wherein the polypeptide has an activity 
of the polypeptide as set forth in SEQ ID NO: 2, SEQ ID 
NO: 4, or SEQ ID NO : 7 ; 

30 (d) a fragment of the amino acid sequence set 

forth in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID NO : 7 
comprising at least about 2 5 amino acid residues, 
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wherein the polypeptide has an activity of the 
polypeptide as set forth in SEQ ID NC:2, SEQ ID NO: 4, 
or SEQ ID NO: 7; 

(e) an amino acid sequence for an allelic variant 
5 or splice variant of either the amino acid sequence as 
set forth in SEQ ID NO : 2 , SEQ ID NO : 4 , or SEQ ID NO: 7, 
or at least one of (a) - (c) wherein the polypeptide has 
an activity of the polypeptide as set forth in SEQ ID 
NO: 2, SEQ ID NO: 4 , or SEQ ID NO : 7 . 

10 

15. An isolated polypeptide comprising the amino 
acid sequence selected from the group consisting of: 

(a) the amino acid sequence as set forth in SEQ ID 
NO: 2, SEQ ID NO : 4 , or SEQ ID NO : 7 with at least' one 

15 conservative amino acid substitution, wherein ' the 
polypeptide has an activity of the polypeptide as set: 
forth in SEQ ID NO:2, SEQ ID NO:4 # or SEQ ID NO:7; 

(b) the amino acid sequence as set forth in SEQ ID 
NO: 2, SEQ ID NO : 4 , or SEQ ID NO: 7 with at least one 

20 amino acid insertion, wherein the polypeptide has an 
activity of the polypeptide as set forth in SEQ ID 
NO: 2, SEQ ID NO: 4, or SEQ ID NO:7; 

(c) the amino acid sequence as set forth in SEQ ID 
NO: 2, SEQ ID NO: 4, or SEQ ID NO: 7 with at least one 

2 5 amino acid deletion, wherein the polypeptide has an 

activity of the polypeptide as set forth in SEQ ID 
NO:2, SEQ ID NO:4, or SEQ ID NO:7; 

(d) the amino acid sequence as set forth in SEQ ID 
NO: 2, SEQ ID NO : 4 , or SEQ ID NO: 7 which has a C- and/or 

3 0 N- terminal truncation, wherein the polypeptide has an 

activity of the polypeptide as set forth in SEQ ID 
N0:2, SEQ ID NO: 4, or SEQ ID NO: 7; and 
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(e) the amino acid sequence as set forth in SEQ ID 
NO: 2, SEQ ID NO: 4, or SEQ ID NO : 7 , with at least one 
modification selected from the group consisting of 
•amino acid substitutions, amino acid insertions, amino 
-5 acid deletions, C- terminal truncation, and N-terminal 
truncation, wherein the polypeptide has an activity of 
the polypeptide as set forth in SEQ ID NO:2, SEQ ID 
NO:4, or SEQ ID NO : 7 . 

10 16. An isolated polypeptide encoded by the nucleic 

acid molecule of .Claims 1, 2, or 3. 

17. The isolated polypeptide according to Claim 14 
wherein the percent identity is determined using a 
15 computer program selected from the group consisting of 
GAP, BLASTP, BLASTN, FASTA, BLASTA, BLASTX, BestFit, 
and the Smith-Waterman algorithm. 

- 18. An antibody produced by immunizing an animal 
2 0 with a peptide comprising an amino acid sequence of SEQ 
ID NO:2, SEQ ID NO : 4 , or SEQ ID NO:7. 

19. An antibody or fragment thereof that 
specifically binds the polypeptide of Claims 13, 14, or 

25 15. 

20. The antibody of Claim 19 that is a monoclonal 
antibody. 

30 21. A hybridoma that produces a monoclonal 

antibody that binds to a peptide comprising an amino 
acid sequence of SEQ ID NO:2, SEQ ID NO: 4, or SEQ ID 
NO : 7 . 



124 



WO 02/10199 



PCT/US01/23232 



22. A method of detecting or quantitating the 
amount of C3b/C4b CR-like polypeptide using the anti- 
C3b/C4b CR-like antibody or fragment of Claims 18, 19, 

5 or 20. 

23 . A selective binding agent or fragment thereof 
that specifically binds at least one polypeptide 
wherein said polypeptide comprises the amino acid 

10 sequence selected from. the group consisting of: 

a) the amino acid sequence as set forth in SEQ ID 
N0:2, SEQ ID NO:4, or SEQ ID NO:7; and 

b) a fragment of the ^mino acid sequence set forth 
in at least one of SEQ ID NO: 2, SEQ ID NO: 4 , or 

15 SEQ ID NO: 7; and. 

c) a naturally occurring variant of (a) or (b) . 

24. The selective .binding agent of Claim 23 that . 
is an antibody or fragment thereof. 

20 

25. The selective binding agent of Claim 23 that 
is a humanized antibody. 

26. The selective binding agent of Claim 23 that 
25 is a human antibody or fragment thereof. 

27. The selective binding agent of Claim 23 that 
is a polyclonal antibody or fragment thereof. 

30 28. The selective binding agent Claim 23 that is a 

monoclonal antibody or fragment thereof. 



125 



WO 02/10199 



PCT/US01/23232 



29. The selective binding agent of Claim 23 that 
is a chimeric antibody or fragment thereof. 

30. The selective binding agent of Claim 23 that 
5 is a CDR-grafted antibody or fragment thereof. 

31. The selective binding agent of Claim 23 that 
is an antiidiotypic antibody or fragment thereof. 

10 32. The selective binding, agent of Claim 23 which 

is a variable region fragment. 

33. The variable region fragment of Claim 32 which 
is a Fab or a Fab' fragment. 

15 

34 . A selective binding agent or fragment thereof 
comprising at least one complementarity determining 
region with specificity for a polypeptide having the 
amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, or SEQ 

20 ID NO: 7. 

35. The selective binding agent of Claim 23 which 
is bound to a detectable label. 

25 36. The selective binding agent of Claim 23 which 

antagonizes . C3b/C4b CR-like polypeptide biological 
activity. 

37. A method for treating, preventing, or 
30 ameliorating a disease, condition, or disorder 
comprising administering to a patient an effective 
amount of a selective binding agent according to Claim 
23 . 
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38. A selective binding agent produced by 
immunizing an animal with a polypeptide comprising an 

• amino acid sequence selected from the group consisting 
5 of SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:7. 

39. A hybridoma that produces a selective binding 
agent capable of binding a polypeptide according to 
Claims 1, 2, or 3. 

10 

40. A composition comprising the polypeptide of 
Claims 13, 14, or 15 and a pharmaceutical ly acceptable 
formulation agent. 

15 41. The composition of Claim 40 wherein the 

pharmaceutic ally acceptable formulation agent is. a 
carrier, adjuvant, solubilizer, stabilizer, or anti- 
oxidant. 

20 42. The composition of Claim 40 wherein the 

polypeptide comprises the mature amino acid sequence 
portion of SEQ ID NO:2, SEQ ID NO: 4, pr SEQ ID NO: 7. 

43. A polypeptide comprising a derivative of the 
25 polypeptide of Claims 13, 14, or 15. 

44. The polypeptide of Claim 43 which is 
covalently modified with a water-soluble polymer. 

30 45. The polypeptide o£ Claim 44 wherein the water- 

soluble polymer is selected from the group consisting 
of polyethylene glycol, monomethoxy-polyethylene 
glycol, dextran, cellulose, poly- (N-vinyl pyrrolidone) 
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J polyethylene glycol, propylene glycol homopolymers , 
polypropylene oxide/ethyl ene oxide co-polymers, 
polyoxyethylated polyols, and polyvinyl alcohol. 

-•5 46. A composition - comprising a nucleic acid 

molecule of Claims 1, 2, or 3 and a pharmaceutically 
acceptable formulation agent. 

47. A composition of Claim 46 wherein said nucleic 
10 acid molecule is contained in a viral vector. 



48. A viral vector comprising a nucleic acid 
molecule of Claims 1, 2, or 3 . 

15. 49. A fusion polypeptide comprising the 

polypeptide of Claims 13, 14, .or . 15 fused to. a 
heterologous amino acid sequence. 

50. The fusion polypeptide of Claim 49 wherein the 

2 0 heterologous amino acid sequence is an IgG constant 

domain or fragment thereof . 

51. A method for treating, preventing or 
ameliorating a medical condition comprising 

25 administering to a patient the polypeptide of Claims 
13, 14, or 15 or the polypeptide encoded by the nucleic 
acid of Claims 1, 2, or 3. 

52. A method of diagnosing a pathological 

3 0 condition or a susceptibility to a pathological 

condition in a subject comprising: 

(a) determining the presence or amount of 

expression of the polypeptide of Claims 13, 14, or 15 
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or the polypeptide encoded by the nucleic acid molecule 
of Claims 1, 2, or 3 in a sample; and 

(b) diagnosing a pathological condition or a 
susceptibility to a pathological condition based on the 
5 presence or amount of expression of the polypeptide. 

53. A device, comprising: 

(a) a membrane suitable for implantation; and 

(b) cells encapsulated within said membrane, 
10 wherein said cells secrete a protein of Claims 13, 14, 

or 15, and wherein said membrane is permeable to said 
protein and impermeable to materials detrimental to 
said cells • 

15 54 . A method of identifying a compound which binds 

to a polypeptide comprising: 

(a) contacting the polypeptide of Claims 13, 14, 
or 15 with a compound; and 

(b) determining the extent of binding of the 
20 polypeptide to the compound. 

55. A method of modulating levels of a polypeptide 
in an animal comprising administering to the animal the 
nucleic acid molecule of Claims 1, 2, or 3 . 

25 

56. A transgenic non-human mammal comprising the 
nucleic acid molecule of Claims 1, 2, or 3 . 
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Figure 1A 

Map of Human C3b/C4b Complement Receptor like cDNA (SEQ ID NO;l) and 

Amino Acid Sequences (SEQ ID NO: 2) 

1 CCTGGGGAAGCCTCTCGGTTCCAGGAAAATGGGATGGTTGATTGCCCTAAATTGATTTTT 6 0 

61 TAAAAGAAAATTCACG7VATTGGCAGCCATAGAATAGAGTAATTTCTGTAAAGCACCAGTG 120 

121 . ATAGTGATGTTTGAATATTAATATAATGGACCAGAGGCTGTACAGTCTTTGAAAGAGGGT 180 

181 CTTGCTACCTATATATCTAGGGTTTGGCTGTTTAAAGCAGCAAGACCCTCCTTTCAGGTG 24 0 

241 GAAGTCGATGTACTTGTTGCCTTACCTAAAAGCTTTGACATTTCTCTTTCTTGCAGGCTC 300 

3 01 ACGGGATCCAGTGTTCCTGACCTCATTGTGAGCATGAGCAACCAGATGTGGCTACATCTG 3 60 

1 MSNQMWLH L 9 

3 61 CAGTCGGATGATAGCATTGGCTCACCTGGGTTTAAAGCTGTTTACCAAGAAATTGAAAAG 420 
10QSDDS I G S PG'FKAVYQE I EK 29 

4 21 GGAGGGTGTGGGGATCCTGGAATCCCCGCCTATGGGAAGCGGACGGGCAGCAGTTTCCTC 480 
30 6GCGDPG-I P.AYGKRTG SSFL 49 

4 81 CATGGAGATACACTCACCTTTGAATGCCCGGCGGCCTTTGAGCTGGTGGGGGAGAGAGTT 54 0 

• 50 HGDTLTF ECPAAFEIiVGERV 69 

541 ATCACCTGTCAGCAGAACAATCAGTGGTCTGGCAACAAGCCCAGCTGTGTATTTTCATGT 6 00 

70 I TCQQNNQWSGNKPSC VFSC 89 

601 TTCTTCAACTTTACGGCATCATCTGGGATTATTCTGTCACCAAATTATCCAGAGGAATAT 660 

90 F FNFTA S S G I.I L S P NY P E EY 109 

661 GGGAACAACATGAACTGTGTCTGGTTGATTATCTCGGAGCCAGGAAGTCGAATTCACCTA 72 0 

110 GNNMNCVWL I I SEPGSR I HL 129 

721 ATCTTTAATGATTTTGATGTTGAGCCTCAATTTGACTTTCTCGCGGTCAAGGATGATGGC 78 0 

130 I FNDFDVEPQFDFLAVKDDG 149 

781 ATTTCTGACATAACTGTCCTGGGTACTTTTTCTGGCAATGAAGTGCCTTCCCAGCTGGCC 84 0 

150 ISDITVLGT FS GNEVPSQIiA 169 

841 AGCAGTGGGCATATAGTTCGCTTGGAATTTCAGTCTGACCATTCCACTACTGGCAGAGGG 900 

170 SSGHI VRLEFQSDHSTTGRG 189 

901 TTCAACATCACTTACACCACATTTGGTCAGAATGAGTGCCATGATCCTGGCATTCCTATA 960 

190 FNI TYTT FGQNECHDPG I PI 209 

961 AACGGACGACGTTTTGGTGACAGGTTTCTACTCGGGAGCTCGGTTTCTTTCCACTGTGAT 1020 

210 NG.RRFGDRFLLGSSVSFHCD 229 

1021 GATGGCTTTGTCAJVGACCCAGGGATCCGAGTCCATTACCTGCATACTGCAAGACGGGAAC 1080 

230 DGFVKTQGSE. SXTCIXiQDGN 249 

1081 GTGGTCTGGAGCTCCACCGTGCCCCGCTGTGAAGCTCCATGTGGTGGACATCTGACAGCG 1140 

250 VVWSSTVPRCEAPCGGHLTA 269 

1141 TCCAGCGGAGTCATTTTGCCTCCTGGATGGCCAGGATATTATAAGGATTCTTTACATTGT 12 00 

270 SSGVILPPGWPGYYKDSLHC 289 

1201 GAATGGATAATTGAAGCAAAACCAGGCCACTCTATCAAAATAACTTTTGACAGATTTCAG 1260 

290 EWI I EAKPGHSIKI TFDRFQ 309 
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Figure IB 

1261 AC^GAGGTC^^TTATGAC^CCTTGGAGGTCAGAGATGGGCCAGCCAGTTCGTCCCCACTG 1320 

310 TEVNYDTLEVRDGPASSSPL 329 

1321 ATCGGCGAGTACCACGGCACCCAGGCACCCCAGTTCCTCATCAGCACCGGGAACTTCATG 1380 

330 IGEYHGTQAPQFLI STGN FM 349 

1381 TACCTGCTATTCACCACTGAOUVCAGCCGCTCCAGCATCGGCTTCCTCATCC^CTATGAG 1440 

350 YLIjFTTDNSRSS IGFLIHYE 369 

1441 AGTGTGACGCTTGAGTCGGATTCCTOCCTGGACCCGGGCATCCCTGTGAACGRCCATCXXr 1500 

j /U t> V 1 J_l u; Jb U £> V_ l_l U .f O JL XT V 1M a . n «v oob 

1501 CACGGTGGAGACTTTGGCATCAGGTCCACAGTGACTTTCAGCTGTGACCCGGGGTACACA 1560 

390 HGGDFGI RSTVTFSCDPGYT 409 

1561 CTAAGTGACGACGAGCCCCTCGTCTGTGAGAGGAACCACCAGTGGAACCACGCCTTGCCC 1620 

410 LSDDEPLVCERNHQWNHALP 429 

1621 AGCTGCGACGCTCTATGTGGAGGCTACATCCAAGGGAAGAGTGGAACAGTCCTTTCTCCT 1680 

430 SCDALCGGYI QGKSGTVL SP 449 

1681 GGGTTTCCAGATT1TTATCCAAACTCTCTAAACYGGACGTGGACGATTGAAGTGTCT 1740 

450 GFPDFYPNSLNXTWT I EVSH 469 

1741 GGGAAAGGAGTTCAAATGATCTTTCACACCTTTCATCTTGAGAGTT^ 1800 

470 GKGVQMI FHTFHLES SHDYL 489 

1801 CTGATCACAGAGGATGGAAGTTTTTCCGAGCCCGTTGCCAGGCTCACCGGGTCGGTGTTG 1860 

490 LI TBDGS FSEPVARLTGSVL 509 

1861 CCTCATACGATCAAGGCAGGCCTGTTTGGAAACTTCACTGCCCAGCTTCGGTTTATATCA 1920 

510 PHTI KAGLFGNFTAQLRF IS 529 

1921 ' GACTTCTCAATTTCGTACGAGGGCTTCAATATCACATTTTCAGAATATGACCTGGAGCCA 1980 

530 DFSISYEGFN ITFSEYDLEP 549 

1981 TGTGATGATCCTGGAGTCCCTGCCTTCAGCCGAAGAATTGGTTTTCACTTTGGTGTGGGA 2040 

550 CDDPGVPAFSRRIGFHFGVG 569 

2041 GACTCTCTGACGTTTTCCTGCTTCCTGGGATATCGTTTAGJ^AGGTGCCRCCAAGCTTACC 2100 

570 D SLTF S C FLGYRLEGAXKLT 589 

2101 TGCCTGGGTGGGGGCCGCCGTGTGTGGAGTGCACCTCTGCCAAGGTGTGTGGCCGAATGT 2160 

590 CLGGG RRVWSAPLPRCVAEC 609 

2161 GGAGCAAGTGTCAAAGGAAATGAAGGAACATTACTGTCT^ 2220 

610 GASVKGNEGTLLSPNFPSNY 629 

2221 GATAATAACCATGAGTGTATCTATAAAATAGAAACAGAAGCCGGCAAGGGGATCCACCTT 2280 

630 DNNHEC I YKI ETEAGKGI HL 649 

2281 AGAACACGAAGCTTCCAGCTGTTTGAAGGAGATACTCTAAAGGTATATGATGGAAAAGAC 2340 

650 RTRSFQLFEGDTLKVYDGKD 669 

2341 AGTTCCTCACGTCCACTGGGCACGTTCACTAAAAATGAACTTCTGGGGCTGATCCTA7VAC 2400 

670 SSSRPIiGTF TKN ELIiGLILN 689 
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Figure 1C 

2401 AGCACATCCAATCACCTRTGGCTAGAGTTCAACACCAATGGATCTGACACGGACCAAGGT 2460 

690 STS NHL* WLEFNTNG SDTDQG 709 

2461 TTTCAACTCACCTATACCAGTTTTGATCTGGTAAAATGTGAGGATCCGGGCATCCCTAAC 2520 

710 FQLTYTSFDLVKCE DPGI PN 729 

2521 TACGGCTATAGGATCCGTGATGAAGGCCACTTTACCGACACTGTAGTTCTGTACAGTTGC 2580 

730 YGYRIRDEGHFTDTVVLYSC 749 

2581 AACCCGGGGTACGCCATGCATGGCAGCAACACCCTGACCTGTTTGAGTGGAGACAGGAGA 264 0 

750 NPG YAMHGSNTLTC LSGD R R. 769 

2641 GTGTGGGACAAACCACTACCTTCGTGCATAGCGGAATGTGGTGGTCAGATCCATGCAGCC 2700 

770 VWDKPLPSCIAECGGQIHAA 789 

2701 ACATCAGGACG7y\TATTGTCCCCTGGCTATCCAGCTCCGTATGACAACAACCTCCACTGC 2760 

790 TSGR ILS PGYPAPYDN NLHC 809 

2761 ACCTGGATTATAGAGGCAGACCCAGGAAAGACCATTAGCCTCCATTTCATTGTTTTCGAC 2820 

810 T W I I EADPGKTI SLHFIVFD 829 

2821 ACGGAGATGGCTCACGACATCCTCAAGGTCTGGGACGGGCCGGTGGACAGTGACATCCTG 2880 

830 TEM AHD I LKVWDG PVDSD I . L 849 

2881 CTGAAGGAGTGGAGTGGCTCCGCCCTTCCGGAGGACATCCACAGCACCTTCAACTCACTC 2940 

850 LKE W SG SALPE D I H S TFNSL 869 

294 1 ACCCTGCAGTTCGACAGCGACTTCTTCATCAGCAAGTCTGGCTTCTCCATCCAGTTCTCC 3000 

870 TLQFDS'DFFI SKSGFSIQFS 889 

3001 ACCTC^TTGG^GCCTVCCTGTAACGATCCAGGTATGCCCCAAAATGGCACCCGCTATGGA 3 060 

890 TSI AATCNDPGMPQNGTRYG 909 

3061 GACAGCAGAGAGGCTGGAGACACCGTCACATTCCAGTGTGACCCTGGCTATCAGCTCCAA 3120 

910 DSREAGDT VTFQCD PGYQLQ 929 

3121 GGACAAGCCTU^AATCTVCCTGTGTGCAGCTGAATAACCGGTTCTTTTGGCAAC 3180 

930 GQAKITCVQLNNRFFWQPDP . 949 

3181 CCTACATGCATAGCTGCTTGTGGAGGGAATCTGACGGGCCCAGCAGGTGTTATTTTGTCA 3240 

950 PTC I AACGGNLTGPAGVI LS 969 

3241 CCCAACTACCCACAGCCGTATCCTCCTGGGAAGGAATGTGACTGGAGAGTAAAAGTGAAC 3300 

970 PNY PQPYPPGKECDWRVKVN 989 

3301 CCGGACTTTGTCATCGCCTTGATATTCAAAAGT 3360 

990 PDFVIALI FKS FNMEPSYDF 1009 

3361 CTACACATCTATGAAGGGGAAGATTCCAACAGCCCCCTCATTGGGAGTTACCAGGGCTCT 3420 

1010 LHIYEGEDSNSPLI GSYQGS 1029 

3421 CAGGCCCCAGTVAAGAATAGAGAGTAGCGGAAAC^GCCTGTTTCTGGCATTTCGGAGTGAT 3480 

1030 QAPERI ESSGNSLFLAFRSD 1049 

34 81 GCCTCCGTGGGCCTTTCAGGGTTCGCCATTGAATTTAAAGAGAAACCACGGGAAGCTTGT 3540 

1050 ASVGLSGFAI E FKE KPREAC 1069 
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3541 TTTGACCCAGGAAATATAATGAAT^ 3600 

1070 FD PGN I MNGTRVGTD FKLGS 1089 

3601 ACCATCACCTACCAGTGTGACTCTGGCTATAAGATTCTTGACCCCTCATCCATCACCTGT 3660 

1090 TI TYQCDSGYKI LDPSS ITC 1109 

3661 GTGATTGGGGCTGATGGGAAACCCTCCTGGGACCAAGTGCTGCCCTCCTC 3720 

1110 VI GADGKPSWDQVLPSCNAP 1129 

3721 TGTGGAGGCCAGTACACGGGATCAGAAGGGGTAGTTTTATCACCAAACTACCCCCATAAT 3780 

— _ — — — mw ww* ~ ~ i-. TT TT T O V\ VT V T3 U XT 11 Vl O 

J. J_ >5 U C la G W X J. Vj o r» vj v v ±j o jt a* j. *. ** 

3781 TACACAGCTCGTCAAATATGCCTCTATTCC 3840 

1150 Y TAGQICL.YSITVPKEFVVF 1169 

3841 GGACAGTTTGCCTATTTCCAGACAGCCCTGAATGATTTGGCAGAATTAT^ 3900 

1170 GQFAYFQTA LNDLAEIjFDGT 1189 

3901 CATGGACAGGCCAGACTTCTC^GCTCACTCTC^ 3960 

1190 HAQARIiLSSliSGSHSOETIiP 1209 

3961 TTGGCTACGTCAAATCAAATTCTGCTCCGATTCAGTGCAAAGAGCGGTGCCTCTGCCCGC 4020 

1210 LATSNQILLRFSAKSGASAR 1229 

4021 GGCTTCCACTTCGTGTATCAAGCTGTTCCTCGTACCAGTGACACCCAATGCAGCTCTGTC 4080 

1230 GFHFVYQAV PRTSDTQCSSV 1249 

4 081 CCCGAGCCC^GATACGGAAGGAGAATTGGTTCTGAGTTTTCTGCCGGCTCCATCGTCCGA 4140 

1250 PE PRYGRRI GSE FSAGS I VR 1269 

4141 TTCGAGTRCAACCCGGGATACCTGCT^ 4200 

1270 FE XN P GYLLQG.S TALHCQ SV 1289 

4201 CCCAACGCCTTGGCACAGTGGAACGACACGATCCCCAGCTGTGTGGTACCCTGCAGTGGC 4260 

1290 PNALAQWN DTI PSC VVPCSG 1309 

4261 . AATTTCACTC^CGAAGAGGTACAATCCTGTCCCCCGGCTACCCTGAGCCATACGGAAAC 4320 

1310 NFTQRRGTILS PGYPEPYGN 1329 

4321 AACTTGAACTGTATATGGAAGATCATAGTTACGGAGGGCTCGGGAATTCAGATCCAAGTG 4380 

1330 NLNCIWKIIVTEGSGIQIQV 1349 

4381 ATCAGTTTTGCCACGGAGCAGAACTGGGACTC^ 444 0 

1350 I S FATEQNWDSLE IHDGGDV 1369 

4441 ACCGCACCCAGACTGGGAAGCTTCTCT^GGCACCACAGTACCGGCACTGCTGAACAGTACT 4500 

1370 TAPRIiGSFSGTTVPALLNST 1389 

4501 TCCAACCAACTCTACCTGCATTTCCAGTCTGACATTAGTGTGGCAGCTGCTC 4560 

1390 SNQLYLHFQSDI SVAAAGFH 1409 

4561 CTGGAATACAAAACTGTAGGTCTTGCTGCATGCCAAGAACC^ 462 0 

1410 LEYKTVGLAACQEPALPSNS 1429 

4621 ATCAAAATCGGAGATCGGTACATGGTGAACGACGTGCTCTCCTTCCAGTGCGAGCCCGGG 4680 

1430 IKIGDRYMVNDVIiSFQCEPG 1449 
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4 6 $1 TACACCCTGCAGGGCCGTTCCCACATTTCCTGTATGCCAGGGACCGTTCGCCGTTGGAAC 4740 

1450 Yt'ZiQGRSHI SCMPGTVRRWN 1469 

4741 TATCCGTCTCCCCTGTGCATTGCAACCTGTGGAGGGACGCTGAGCACCTTGGGTGGTGTG 4800 

1470 YPSPLCIATCGGTLSTLGGV 1489 

4 8 d 1 ATCCTGAGCCCCGGCTTCCCAGGTTCTTACCCCAACAACTTAGACTGCACCTGGAGGATC 4860 

1490 ILSPGFPGSYPN NLD CTWRI 1509 

4861 TCATTACCCATCGGCTATGGTGCACATATTCAGTTTCTGAATTTTTCTACCGAAGCTAAT 4920 

1510 SLPIGYGAHIQFLNFSTEAN 1529 

49^1 CATGACTTCCTTGAAATTCAAAATGGACCTTACCACACCAGCCCCATGATTGGAC/^ATTT 4980 

1530 HDFLEIQNGPYHTSPMIGQF 1549 

4981 AGCGGCACGGATCTCCCCGCGGCCCTGCTGAGCACAACGCATGAAACCCTCATCCACTTT 5040 

1550 SGTDLiPAALliSTTHETLt I HF 1569 

5041 TATAGTGACCATTCGCAAAACCGGCAAGGATTTAAACTTGCTTACCAAGCCTATGAATTA 5100 

1570 YSDHSQNRQGFKLAYQAYEL 1589 

5101 CAG7U^CTGTCCAGATCCACCCCCATTTCAGAATGGGTACATGATCAACTCGGATTACAGC 5160 

1590 QNCPDPPPFQNGYMI NSDYS 1609 

5161 GTGGGGCAATCAGTATCTTTCGAGTGTTATCCTGGGTACATTCTAATAGGCCATCCTGTC 5220 

1610" VGQSV SFECYPGYIIilGHPV 1629 

5221 CTCACTTGTCAGCATGGGATCAACAGAAACTGGAACTACCCTTTTCCAAGATGTGATGCC 5280 

1630 LT CQHGINRNWNY PF PR CD . A 1649 

5281 CCTTGTGGGTACAACGTAACTTCTCAGAACGGC7VCCATCTACTCCCCTGGCTTTCCTGAT 5340 

1650 PCGYNVTSQNGTI YS P GFPD 1669 

5341 GAGTATCCGATCCTGAAGGACTGCATTTGGCTCATCACGGTGCCTCCAGGGCACGGAGTT 5400 

1670 EYPI IiKDCIWIiITVPPGHGV 1689 

5401 TACATCAACTTCACCCTGTTACAGACGGAAGCTGTCAACGATTAC^TTGCTGTTTGGGAC 5460 

1690 YINFTIiLQTEAVNDYIAVWD 1709 

5461 GGTCCCGATCAGAACTCACCCCAGCTGGGAGTTTTCAGTGGCAACACAGCCCTCGAAACG 5520 

1710 GPDQN S P.QLGVFS G N T A L» E T 1729 

5521 GCGTATAGCTCCACCAACCAAGTCCTGCTCAAGTTCCACAGCGACTTTTCAAATGGAGGC 5580 

1730 AYS STNQVLLKFHSDFSNGG 1749 

5581 TTCTTTGTCCTCAATTTCCACGCATTTCAGCTCAAGAAATGTCAACCTCCC 5640 

1750 FFVLNFHAFQIiKKCQPPPAV 1769 

5641 CCACAGGCAGAAATGCTTACTGAGGATGATGATTTCGAGATAGGAGATTTTGTGAAGTAC 5700 

1770 PQAEMLTEDDDFE I GDFVKY 1789 

5701 CAGTGCCACCCCGGGTACACCTTGGTGGGGACCGACATTCTGACTTGCAAGCTCAGTTCC 5760 

1790 QCHPGYTLVGTDI LTCKLSS 1809 

5761 CAGTTGCAGTTTGAGGGTTCTCTCCCAACATGTGAAGCACAATGCCCAGCAAATGAAGTC 5820 

1810 QLQFEGSLPTCEAQC PANEV 1829 
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5821 CGGACTGGATCATCGGGAGTCATTCTCAGTCCAGGGTATCCGGGTAATTATTTTAACtCC 5880 

1830 RTGSSGVI LSPGYPGNYFNS 1849 

5881 CAGACTTGCTCTTGGAGTATTAAAGTGGAACCAAACTACAACATTACCATCTTTGTGGAC 5940 

1850 QTCSWSIKVEPNYNITI FVD 1869 

5941 ACATTTCAAAGTGAAAAGCAGTTTGATGCACTGGAAGTGT 6000 

1870 TFQSEKQFDALEVFDGSSGQ 1889 

6001 AGTCCTCTGCTAGTAGTCTTAAGTGGGAATCATACTGAACAATCAAATTTTACAAGCAGG 6060 

1690 S F IiIjVVL S G NKTE Q S HPT 3 R 1903 

6061 AGTAATCAGTTATATCTCCGCTGGTCCACTGACCATGCCACCAGTAAGAAAGGATTCAAG 6120 

1910 SNQliYLRWS TDHATSKKGFK 1929 

6121 ATTCGCTATGCAGCACCTTACTGCAGTTTGACCCACCCCCTGAAGAATGGGGGTATTCTA 6180 

.1930 I R Y A A P Y C S LTHPLKH GG I L 1949 

6181 AAGAGGACTGCAGGAGCGGTTGGAAGCAAAGTGCATTATTTTTGCAAGCCTGGATACCGA 6240 

1950 NRTA GAVG S KVHYFCKPGYR 1969 

6241 ATGGTCGGCCACAGCAATGCAACCTGTAGACGAAACCCACTTGGCATGTACCAGTGGGAC 6300 

1970 MVGHSNATCRRNPIiGMYQWD 1989 

6301 TCCCTCACGCCACTCTGCCAGGCTGTGTCCTGTGGAATCCCAGAATCCCCAGGAAACGGT 6360 

1990 SLTPLCQAVSCGI PESPGNG 2009 

6361 TCATTTACCGGGAACGAGTTCACTTTGGACAGTAAAGTGGTCTATGAATGTCATGAGGGC 6420 

2010 S FTGNEFTL DSKVVYEC. H EG 2029 

6421 TTCAAGCTTGAATCGAGCCAGCAAGCAACAGCCGTC 6480 

2030 FKLESSQQATAVCQEDGLWS 2049 

6481 AACAAGGGGAAGCCGCCCACGTGTAAGCCGGTCGCTTGCCCCAGCATTGAAGCTCAGCTC 6540 

2050 NKGK PPTCKPVACPS I EAQL 2069 

6541 * TCAGAACATGTCATCTGGAGGCTGGTTTCAGGATCCTTGAATGAGTACGGTGCTCAAGTA 6600 

2070 SEHVIWRliVSGSLNEYGAQV 2089 

6601 TTGCTGAGCTGCAGTCCTGGTTACTACTTAGAAGGCTGGAGGCTCCTGCGGTGCCAGGCC 6660 

2090 LIiSCSPGYYLEGWRLIjRCQA 2109 

6661 AATGGGACGTGGAACATAGGAGATGAGAGGCCAAGCTGTCGAGTTATCTCGTGTGGAAGC 6720 

2110 NGT WNIGDERPSCRVISCGS 2129 

6721 CTTTCCTTTCCCCCAAATGGCAACAAGATTGGAACGTTGACAGTTTATG^ 6780 

2130 LSFPPNGNKIGTLTVYGATA 2149 

6781 ATATTTACGTGCAACACCGGCTACACGCTTGTGGGGTCTCATGTCAGAGAGTGCTTGGCA 6840 

2150 1 FTCNTGYTLVGSHVRECLA 2169 

6841 AATGGGCTCTGGAGCGGCAGCGAAACTCGATGTCTGGCTGGCCACTGCGGTTCCCCAGAC 6900 

2170 NGLWSGSETRCLAGHCGSPD 2189 

6901 CCGATTGTGAACGGTCACATTAGTGGAGATGGCTTCAGTTACAGAGACACGG 6960 

2190 PIVNGHI SGDGFSYRDTVVY 2209 
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6961 CAGTGCAATCCTGGTTTCCGGCTTGTGGGAACTTCCGTGAGGATATGCCTGCAAGACCAC 7020 

2210 QCNPGFRbVGTSVRI CLQDH 2229 

7021 AAGTGGTCTGGACAAACGCCTGTCTGTGTCCCCATCACATGTGGTCACCCTGGAAACCCT 7080 

2230 K WSGQTPVCVP I TC'GHPGNP 2249 

7081 GCCCACGGATTCACTAATGGCAGTGAGTTCAACCTGAATGATGTCGTGAATTTCACCTGC 7140 

2250 AHGFTNGS EFNLNDVVNFTC 2269 

7141 AACACGGGCTATTTGCTGCAGGGCGTGTCTCGAGCCCAGTGTCGGAGCAACGGCCAGTGG 7200 

2270 NTGYLIiQGVSRAQCRSNGQ W. 2289 

7201 AGTAGCCCTCTGCCCACGTGTCGAGTGGTGAACTGTTCTGATCCAGGCTTTGTGGAAAAT 7260 

2290 SSPIiPTCRVVNCSDPGFVEN 2309 

7261 GCCATTCGTCACGGGCAACAGAACTTCCCTGAGAGTTTTGAGTATGGAATGAGTATCCTG 7320 

2310 AIRHGQQ NFPESF EYG MSIL 2329 

7321 TACCATTGCAAGAAGGGATTTTACTTGCTGGGATCTTCAGCCTTGACCTGTATGGCAAAT 7380 

2330 YHCKKGFY LLGSSALTCMAN 2349 

7381 GGCTTATGGGACCGATCCCTGCCCAAGTGTTTGGCTATATCGTGTGGACACCCAGGGGTC 7440 

2350 GLWDRSLP KCIjAI SCGHPGV 2369 

7441 CCTGCCAACGCCGTCCTCACTGGAGAGCTGTTTACCTATGGCGCCGTCGTGCACTACTCC 7500 

2370 PANAVLTGELFTYGAVVHYS 2389 

7501 TGCAGAGGGAGCGAGAGCCTCATAGGCAACGACACGAGAGTGTGCCAGGAAGACAGTCAC 7560 

2390 CRGSES LI GNDTRVCQEDSH 2409 

7561 TGGAGCGGGGCACTGCCCCACTGCACAGGAAATAATCCTGGATTCTGTGGTGATCCGGGG 7620 

2410 WSGALPHCTGNNPGFCGDPG 2429 

7621 ACCCCAGCACATGGGTCTCGGCTTGGTGATGACTTTAAGACAAAGAGTCTTCTCCGCTTC 7680 

2430 T PAHGSR LGDDFKTKSLLRF 2449 

7681 TCCTGTGAAATGGGGCACCAGCTGAGGGGCTCCCCTGAACGCACGTGTTTGCTCAATGGG 7740 

2450 SCEMGHQLRGS PERTC LLNG 2469 

7741 TCATGGTCAGGACTGCAGCCGGTGTGTGAGGCCGTGTCCTGTGGCAACCCTGGCACACCC 7800 

2470 SWSGLQPVCE AVSCGNPGTP 2489 

7801 ACCAACGGAATGATTGTCAGTAGTGATGGCATTCTGTTCTCCAGCTCGGTCATCTATGCC 7860 

2490 TNGMIVSSDGILFSS SVIYA 2509 

7861 TGCTGGGAAGGCTACAAGACCTCAGGGCTC^ 7920 

2510 CWEGYKTS GLMTRHCTANGT 2529 

7 921 TGGACAGGCACTGCTCCCGACTGCACAATTATAAGTTGTGGGGATCCAGGCACACTAGCA 7980 

2530 WTGTAPDCTI I SCGDPGTLA 2549 

7981 AATGGCATCCAGTTTGGGACCGACTTCACCTTCAACAAGACTGTGAGCTATCAGTGTAAC 8040 

2550 NGIQFGTDFTFNKTVSYQCN 2569 
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8041 CCAGGCTATGTCATGGAAGCAGTC^CATCCGCCACTATTCGCTGTACCAAAGACGGCAGG 8100 

2570 PGYVMEAVTSATI RCT KDGR - 2589 

8101 TGGAATCCGAGCAAACCTGTCTGCAAAGCCGTGCTGTGTCCTCAGCCGCCGCCGGTGCAG 8160 

2590 WNP S KPVC K A V Ii . C PQ P P PVQ 2609 

8161 AATGGAACAGTGGAGGGAAGTGATTTCCGCTGGGGCTC 8220 

2610 NGTVEGSD FRWGSS I SYSCM 2629 

8221 GACGGTTAC CAGCTCTCTCACTCCGCCATC CTCTCCTGTGAAGGTCGCGGGGTGTGGAAA 8280 

2630 DGYQLSHSAIIiSCEGRGVWK 2649 

8281 GGAGAGATCCCCCAGTGTCTCCCTGTGTTCTGCGGAGACCCTGGCATCCCCGCAGAAGGG 8340 

2650 GEI PQCLPVFCGDPGI PAEG 2669 

8341 CGACTTAGTGGGAAAAGTTTC^CCTATAAGTCCGAAGTGlTCTTCCAGTGCAAATCrCCA 8400 

2670 RLSGK S FTYKSEVFFQCKSP 2689 

8401 TTTATACTCGTGGGATCCTCCAGAAGAGTCTGCGAAGCTGACGGCACGTGGAGCGGCATA 8460 

2690 FILVGS SRRVCQADGTWSGI 2709 

84 61 CAACCCACCTGCATTGATCCTGCTCATAACACCTGCCCAGACCCTGGTACGCCACACrTT 8520 

2710 QPTCIDPAHNTCPDPGTPHF 2729 

8 5 2 .1 GGAATACAGAATAGCTCCAGAGGCTATGAGGTTGGAAGCACGGTTTTTTTCAGGTGCAGA 8580 

2730 GIQNSSRGYEVGSTVFFRCR 2749 

8581 AAAGGCTACC^TATTCAAGGTTCGACGACT 8640 

2750 KGYHIQGSTTRTCIiANIiTW S 2769 

8641 GGGATACAGACCGAATGTATACCTCATGCCTGCAGACAGCCAGAAACCCCGGCACACG 8700 

2770 GIQTECIPHACRQPETPAHA 2789 

8701 GATGTGAGAGCCATCGATCTTCCTACTTTCGGCTACACCTTAGTGTACACCTGCCATCCA 8760 

2790 DVRAI DL1PTFGYTI1VYTCHP 2809 

8761 GGCTTTTTCCTCGCAGGGGGATCTGAGCACAGAACATGTAAAGCAGACATGAAATGGACA 8820 

2810 GFFLAGGSEHRTCKADMKWT 2829 

8821 GGAAAGTCGCCTGTGTGTAAAAGTAAAGGAGTGAGAGAAGTTAATGAAACAGTTACTAAA 8880 

2830 G . K S PVCKSKGVREVNETVTK 2849 

8881 ACTCCAGTTCCTTCAGATGTCTTTTTCGTCAATTCACTGTGGAAGGGGTATTATGAATAT 8940 

2850 TPVPS DVFFVNSLWKGYYEY 2869 

8 94 1 TTAGGGAAAAGACT^CCCGCCACTCTAACTGTTGACTGGTTCAATGCAACAAGC^GTAAG 9000 

2870 LGKRQPATLTVDWFNATSSK 2889 

9001 GTGAATGCCACCTTCAGCGAAGCCTCGCCAGTGGAGCTGAAGTTGACAGGCATTTACAAG 9060 

2890 VNATFSEASPVEIiKIiTGIYK 2909 

9061 AAGGAGGAGGCCCACTTACTCCTGAAAGCTTTTCAAACT 9120 

2910 KEEAHLIjLKAFQIKGQADIF 2929 

9121 GTAAGCAAGTTCGAAAATGACAACTCX3GGACTAGATGGT^ 9180 

2930 VSKFENDNWGLDGYVS S G L» E 2949 
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9 1 8 1 AGAGGAGGATTTACTTTTC AAGGTGACATTCATGGAAAAGACTTTGGAAAATTTAAGCTA 9240 

2950 RGGFTFQGDIHGKDFGKFK-L 2969 

9241 GAAAGGCAAGATCCTTTAAACCCAGATCAAGACTCTTCCAGTCATTACCACGGCACCAGC 9300 

2970 ERQDPLNPDQDSS SHYHGTS 2989 

9 3 0 1 AGTGGCTCTGTGGCGGCTGCCATTCTGGTTCCTTTCTTTGCTCTAATTTTATCAGGGTTT 9360 
2990 SGSVAAAI LVPFFALILSGF 

9361 GCATTTTACCTCTACAAACACAGAACGAGACCAAAAGTTCAATACAATGGCTATGCTGGG 9420 

3010 AFYLYKHRTRPKVQYNGYAG 3029 

942?. CATGAAAACAGCAATGGACAAGCATCGTTTGAAAACCCCATGTATGATACAAACTTAAAA 9480 

3030 HENSNGQASFENPMYDTNLK 3049 



9481 CCCACAGAAGCCAAGGCTGTGAGGTTTGACACAACTCTGAACACAGTCTGTACAGTGGTA 9540 
3050 PTEAKAVRFDTTLNTVCTVV 3069 



• 9541 TAGCCCTCAGTGCCCCAACAGGACTGATTCATAGCCATACCTCTGATGGACAAGCAGTGA 9600 

3070 * 3070 

9601 TTCCTTTGGTGCGATATACGACTCTCCCT 9660 

9661 CCTTGTCTACTGGC ATAAGTGC AGCGGGGATCTCT ACTCAAATGTGTCAGGGTCTTCTAC 9720 

9721 GGATCAAACTACACATGCGTTTTCATTCCAAAAGTGGGTTCTAAATGCCTGGCTGCATCT 9780 

9781 GTATGAAATCAAGGCACACTCCAGGAAGACTGCCACGTCGCGCCAACACGTCATACTCAA 9840 

9841 TRCCTCAGACTTTCATATTTCTGTGTTGCTGAGATGCCTTTCAATGCAATCGTCTGGGCT 9900 

9901 CGTGGATATGTCCCTCAGGTGCGGTGA(^GAATGGTGGCACCACGATATGTGTTCTCTTG 996 0 

9961 TGTTGTTTTTCCTTTTTAAACCCCCATGAACACGAATACTCTGAAAAAAATAAAAAGCTT 10020 

10021 TCTGGAAGAAGACACCTTTCTGATAGAGGCTCACACCTACAAATGCTTCACTCTGTCCTT 10080 

10081 CCGAGACCTGACAAGCTTTGAGGACCTCACAGCTCCCCTGTGTGTTCATCTCTAGGGATG 10140 

10141 TTTGCAATTTCCCAGTCAGCTGTTCTGTCGCAGAATGTTTAATGCACAATTTTTTGCACT 10200 

10201 AGTGTGTTATG AATGACTAAGATTCTGATAAAAAAAATAAATTATTTACAC AGGGTTT AT 10260 

10261 ACACACTATCCATTGTATATAAGCATTATTTCATATTATCAAGCTAAACATTCCCCCATC 10320 

10321 AGCTTAGTTGGAGTGTTAGGGAAAAGTATTCCTAGATATGGCACAGATTTTAAAAGGAAA 10380 

10381 TACAGTATTGACGAGATTTATTTTATTATTGCTTCAATTAGCTCCATTTACGTGTTGAAT 10440 

10441 TCATTGAAG AGGTCCAATGAGAAAAAAAC AGAAGCCTCCTTATTTCACACGTTTTCCTCC 10500 

10501 TTTAGTACCATCCTCATCCAATTACTGTCTCTCTGATACTACTTAATAGCAGGGGGTTTG 10560 

10561 CAGAAATTTCTGTTTGCCATGTAAAACTGTGAATAGTAATTTATTTTAGATAGTCGATGA 10620 

10621 ACTTGTGGGTTTTAGCTCACAATGCAGCCTTCCCTTTTGCAGTGTTTTTTTTT 10673 
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Figure 2A 

Map of Second Human C3b/C4b Complement Receptor like cDNA (SEQ ID 
*~ NO; 6) and Amino Acid Sequences (SEQ ID NO: 7) 



1 ACCCTGACGGTTGGTGATGCTGGGAAGGTGGGAGACACCAGATCGGTCTTGTACGTGCTC 6 0 
lTLTVGDAGKVGDTRSVLYVL -20 

6 1 ACGGGATGCAGTGTTCCTGACCTCATTGTGAGCAT 120 

21TGS SVPDLIVSMSNQM WLHL 40 

121 CAGTCGGATGATAGCATTGGCTCACCTGGGTTT^ 180 

41QSDDS I GSPGFKAVYQEI EK 60 

181 GGAGGGTGTGGGGATCCTGGAATCCCCGCCTATGGGAAGCGGACGGGCAGCAGTTTCCTC 24 0 

61 GGC GD P G I PAYG KRTG S S FL 80 

241 C^TGGAGATACACTCACCTTTGAATGCCCGGCGGCCTTTGAGCTGGTGGGGGAGAGAGTT 300 

81HGDTLTFE .CPAAFE L V G E R V 100 

3 01 ATCACCTGT(^GCAGAACAATCAGTGGTCTGGCAACAAGCCCAGCTGTGTATTTTCATGT 360 

101 ITCQQNNQWSGNKPSCVFSC 120 

361 TTCTTGAACTTTACGGCATCATCTGGGAT^ 420 

121 FFNFTASS'GI I LSPNYPEEY 140 

421 GGGAACAACATGAACTGTGTCTGGTTGATTATCTCGGAGCCAGGAAGTCGAATTCACCTA 4 80 

141 GNNMNC.VWLI I SEPGSRIHI* 160 

481 ATCTTTAATGATTTTGATGTTGAGCCTCAATTTGACTTTCTCGCGGTCAAGGATGATGGC 540 

161 I FN D FDVE P Q FD FLAV KDDG 180 

541 ATTTCTGACATAACTGTCCTGGGTACTTTTTCTGG 600 

181 I SD I TVLGTFSGNEVPSQLA 200 

601 AGCAGTGGGCATATAGTTCGCTTGGAATTTCAGTCTGACCATTCCACTACTGGCAGAGGG 660 

201 S S G H I V'RLE F Q S DH S T TG RG 220 

661 TTCAACATGACTTACACCACATTTGGTCAGAATGAGTGCCATGATCCTGGCATTCCTATA 720 

221 FNI TYTTFGQNECHDPGI PI 240 

721 AACGGACGACGTTTTGGTGACAGGTTTCTACTCGGGAGCTCGGTTTCTTTCCACTGTGAT 780 

241 NGRRFGDRFLLGSSVSFHCD 260 

781 GATGGCTTTGTCAAGACCCAGGGATCCGAGTCCATTACCTGCATACTGCAAGACGGGA^ 840 

261 DGFVKTQGSESI TCI LQDGN 280 

841 GTGGTCTGGAGCTCC^CCGTGCCCCGCTGTGAAGCTCCATGTGGTGGACATCTC 900 

281 VVWSSTVPRCEAPCGGHLTA . 300 

901 TCCAGCGGAGTC^TTTTGCCTCCTGGATGGCCAGGATATTATAAGGATTCTTTACATTGT 960 

301 SSGVILPPGWPGYYKDSLHC 320 

961 GAATGGATAATTGAAGCAAAACCAGGCCACTCTATCAAAATA 1020 

321 EWI IEAKPGHSI KITFDRFQ 340 

1021 ACAGAGGTC^TTATGACACCTTGGAGGTCAGAGATGGGCCAGCCAGTTCGTCCCCACTG 1080 

341 TEVNYDTLEVRDGPAS SS PL 360 
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Figure 2B 

» 

1 0 81 ATCGGCGAGTACCACGGCACCCAGGCACCCCAGTTCCTCATCAGCACCGGGAACTTCATG 1140 

361 I GEYHGTQAPQFLI STGNFM 380 

1141 TACCTGCTATTCACCACTGACAACAGCCGCTCCAGCATCGGCTTCCTCATCCACTATGAG 1200 

381 YLLFTTDNSRSS IGFLIHYE 400 

1201 AGTGTGACGCTTGAGTCGGATTCCTGCCTGGACCCGGGCATCCCTGTGAACGRCCATCGC 1260 

401 SVTIiESDS CLDPGIPVNXHR 420 

1261 CACGGTGGAGACTTTGGCATCAGGTCCACAGTGACTTTCAGCTGTGACCCGGGGTACACA 1320 

421 HGGDFGI RSTVTFSCD PGYT 440 

1 3 21 CTAAGTGACGACGAGCCCCTCGTCTGTGAGAGGAACCACCAGTGGAACCACGCCTTGCCC 1380 

441 LSDDEPLVCERNHQWNHALP 460 

1381 AGCTGCGACGCTCTATGTGGAGGCTACATCCAAGGGAAGAGTGGAACAGTCCTTTCTCCT 1440 

461 SCDALiCGGY IQGKSGTVLSP 480 

1441 GGGTTTCCAGATTTTTATCCAAACTCTCTAAACYGCACGTGGACCATTGAAGTGTCTCAT 1500 

481 GFPDFYPNSLNXTWTIEVSH 500 

1501 GGGAAAGGAGTTCAAATGATCTTTCACACCTTTCATCTTGAGAGTTCCCACGACTATTTA 1560 

501 GKGVQMI FHTFHIiESSHDYL 520 

1561 CTGATCACAGAGGATGGAAGTTTTTCCGAGCCCGTTGCCAGGCTCACCGGGTCGGTGTTG 1620 

521 LITEDGSFSEPVARLTGSVL 540 

1621 CCTCATACGATCAAGGCAGGCCTGTTTGGAAACTTCACTGCCCAGCTTCGGTTTATATCA 1680 

541 PHTI KAGLFGNFTAQLRFIS 560 

1681 GACTTCTCAATTTCGTACGAGGGCTTCAATATCACATTTTCAGAATATGACCTGGAGCCA 1740 

561 DFSI SYEGFNITFSEYDLEP 580 

1741 TGTGATGATCCTGGAGTCCCTGCCTTCAGCCGAAGAATTGGTTTTCACTTTGGTGTGGGA 1800 

581 CDDPGVPAFSRRIGFHFGVG 600 

1801 GACTCTCTGACGTTTTCCTGCTTCCTGGGATATCGTTTAGAAGGTGCCRCCAAGCTTACC 1860 

601 DSLTFSCFIiGYRLEGAXKLT 620 

1861 TGCCTGGGTGGGGGCCGCCGTGTGTGGAGTGCACCTCTGCCAAGGTGTGTGGCCGAATGT 1920 

621 CliGGGRRVWSAPLPRCVAEC 640 

1921 GGAGCAAGTGTCAAAGGAAATGAAGGAACATTACTGTCTC 1980 

641 GASVKGNEGTLLS PNFPSNY 660 

1981 GATAATAACCATGAGTGTATCTATAAT^TAGAAACAGAAGCCGGCAAGGGCATCGACCTT 2040 

661 DNNHECIYKI ETEAGKGIHL 680 

2041 AGAACACGAAGCTTCCAGCTGTTTGAAGGAGATACTCTAAAGGTATATGATGGAAAAGAC 2100 

681 RTRS FQLFEGDTLKVYDGKD 700 

2101 AGTTCCTCACGTCCACTGGGCACGTTCACTAAAAATGAACTTCTGGGGCTGATCCTA7^AC 2160 

701 SSSRPLGTFTKNELLGLILN 720 

2161 AGCACATCCAATCACCTRTGGCTAGAGTTCAACACCAATGGATCTGACACCGACCAAGGT 222 0 

721 STSNHLWLEFNTNGSDTDQG 740 
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2221 TTTCAACTCACCTATACCAGTTTTGATCTGGTAAAATGTGAGGATCCGGGCATCCCTAAC 2280 

741 FQLTYTS FDLVKCED PG .I PN 760 

2281 TACGGCTATAGGATCCGTGATGAAGGCCACTTTACCGACACTGTAGTTCTGTACAGTTGC 2340 

761 YGYR I RDBGH FTDTVVLY SC 780 

■ 2341 AACCCGGGGTACGCCATGCATGGCAGCAACACCCTGACCTGTTTGAGTGGAGACAGGAGA 2400 

781 NPGYAMHGSNTLTCLSGDRR 800 

24 01 GTGTGGGACAAACCACTACCTTCGTGCATAGCGGAATGTGGTGGTCAGATCCATGCAGCC 2460 

801 VWDKPLPSCIAECGGq'i HAA 820 

24 61 ACATCAGGACGAATATTGTCCCCTGGCTATCC^^ 2520 

821 TSGRILSPGYPAPYDNNLHC 840 



2521 ACCTGGATTATAGAGGCAGACCC^GGAAAGACCATTAGCCTCCATTTCATTGTTTTCGAC 2580 

841 TWI IEADPGKTISLHFIVFD 860 

2581 ACGGAGATGGCTCACGACATCCTCAAGGTCTGGGACGGGCCGGTGGACAGTGACATCCTG 2640 

861 TEMAH DI LKVWDGPVDS D I L 880 

2641 CTGAAGGAGTGGAGTGGCTCCGCCCTTCCGGAGGACATCCACAGCACCTTCAACTCACTC 2700 

881 LKEWSGSALPEDIHSTFNSL 900 

2701 ACCCTGCAGTTCGACAGCGACTTCTTCATCAGCAAGTCTGGCTTCTCCATCCAGTTCTCC 2760 

901 TLQFDSDFFI SKSGFSI QFS 920 

2761 ACCTCAATTGCAGCCACCTGTAACGATCCAGGTATGCCCCAAAATGGCACCCGCTATGGA 2820 

921 TS I AAT CND PGMPQNGTRYG 940 

2821 GACAGCAGAGAGGCTGGAGACACCGTCACATTCCAGTGTGACCCTGGCTATCAGCTCCAA 2880 

941 DSRE. AGDTVTFQCD. PGYQLQ 960 

2881 GGACAAGCCAAAATCACCTGTGTGCAGCTGAATAACCGGTTCTTTTGGCAACCAGACCCT 2940 

961 GQAKITCVQLNNRFFWQPDP 980 

2941 CCTACATGCATAGCTGCITGTGGAGGGAATCTGACGG 3000 

981 PTCIAACGGNLTGPAGVI LS 1000 

.3001 CCCAACTACCCACAGCCGTATCCTCCTGGGAAGGAATGTGACTGGAGAGTAAAAGTGAAC 3060 

1001 PNYPQPYPPGKECDWRVKVN 1020 

3061 CCGGACTTTGTCATCGCCTTGATATTCT^AAAGTTTCAACATGGAGCCCAGCTAT 3120 

1021 PDFVIALI FKSFNMEPSYDF 1040 

3121 CTACACATCTATGAAGGGGAAGATTCCAACAGCCCCCT 3180 

1041 LHIYEGEDSNSPIilGSYQGS 1060 

3181 CAGGCCCCAGAAAGAATAGAGAGTAGCGGAAAC^GCCTGTTTCTGGCATTTCGGAGTGAT 3240 

1061 QAPERIESSGNSIaFLAFRSD - 1080 

3241 GCCTCCGTGGGCCTTTCAGGGTTCGCCAT^ 3300 

1081 ASVGLSGFAI EFKEKPREAC 1100 

33 01 TTTGACCCAGGAAATATAATGAATGGGACAAGAGTT 3360 

1101 FDPGNIMNGTRVGTDFKLGS 1120 



3361 
1121 



ACCATCACCTACCAGTGTGACTCTGGCTATAAGATTCTTGACCCCTCATCCATCACCTGT 3420 
TI TYQCDSGYKILDPSS I TC 1140 
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3421 GTGATTGGGGCTGATGGGAAACCCTCCTGGGACCAAGTGCTGCCCTCCTGCAATGCTCCC 3480 

1141 VIGADGKPSWDQVLPSCNAP 1160 

34 81 TGTGGAGGCCAGTACACGGGATCAGAAGGGGTAGTTTTATCACCAAACTACCCCCATAAT 3540 

1161 CGGQYTGSEGVVLS PNYPHN 1180 

3541 TACACAGCTGGTCAAATATGCCTCTATTCCATCACGGTACCAAAGGAATTCGTGGTCTTT 3600 

1181 Y TA G Q I C L Y S I TV P KE.FV V P 1200 

36 01 GGACAGTTTGCCTATTTCCAGACAGCCCTGAATGATTTGGCAGAATTATTTGATGGAACC 3660 

1201 GQFAYFQTALNDLAELFDGT 1220 

3661 CATGCACAGGCCAGACTTCTCAGCTCACTCTCGGGGTCTC7VCTCAGGGGAAACATTGCCC 3720 

1221 HAQARLLSSLSGSHSGETLP 1240 

3721 TTGGCTACGTCT^AATCAAATTCTGCTCCGATTCAGTGCAAAGAGCGGTGCCTCTGCCCGC 3780 

1241 L ATSNQI LLRFSAKSGASAR 1260 

3781 GGCTTCCACTTCGTGTATCAAGCTGTTCCTCGTACCAGTGACACCCAATGCAGCTCTGTC 3840 

1261 GFHFVYQAVPRTSDTQCS SV 1280 

3841 CCCGAGCCCAGATACGGAAGGAGAATTGGTTCTGAGTTTTCTGCCGGCTCCATCGTCCGA 3900 

1281 P E PR YGRR I GS E F SAG S I VR 1300 

3901 TTCGAGTRCAACCCGGGATACCTGCTTCAGGGTTCCACGGCGCTCCACTGCCAGTCCGTG 3960 

1301 FEXNPGYLLQGSTAL.HCQSV 1320 

3961 CCCAACGCCTTGGCACAGTGGAACGACACGATCCCCAGCTGTGTGGTACCCTGCAGTGGC 4020 

1321 PNALAQWND T I PSCVVPC SG 1340 

4021 AATTTCACTCAACGAAGAGGTACAATCCTGTCCCCCGGCTACCCTGAGCCATACGGAAAC 4080 

1341 NFTQRRGTI LSPGYPEPYGN 1360 

4081 AACTTGAACTGTATATGGAAGATCATAGTTACGGAGGGCTCGGGAATTCAGATCCAAGTG 4140 

1361 NLNCIWKI IVT EGSGIQI QV 1380 

4141 ATCAGTTTTGCCACGGAGCAGAACTGGGACTCCCTTGAGATCCACGATGGTGGGGATGTG 4200 

1381 I S FATEQNWDSLEI HDGGDV 1400 

4201 ACCGCACCCAGACTGGGAAGCTTCTCAGGCACCACAGTACCGGCACTGCTGAACAGTACT 4260 

1401 TAP RLGS FS GTTVPALLN ST 1420 

4261 TCCAACCAACTCTACCTGCATTTCCAGTCTGACATTAGTGTGGCAGCTGCTGGTTTCCAC 4320 

1421 S NQLYLH FQSDI SVAAAG FH 1440 

4321 CTGGAATACAAAACTGTAGGTCTTGCTGCATGCCAAGAAC 4380 

1441 LEYKTVG LAACQEPA LPS NS 1460 

4381 ATCAAAATCGGAGATCGGTACATGGTGAACGACGTGCTCTCCTTCCAGTGCGAGCCCGGG 4440 

1461 I KIGDRYMVNDVLS FQCE PG 1480 

4441 TACACCCTGCAGGGCCGTTCCCACATTTCCTGTATGCCAGGGACCGTTCGCCGTTGGAAC 4500 

1481 YTLQGRSH I SCMPGTVRRWN 1500 

4501 TATCCGTCTCCCCTGTGCATTGCAACCTGTGGAGGGACGCTGAGCACCTTGGGTGGTGTG 4560 

1501 YPSPLC IATCGGTLSTLGGV 1520 
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4 5 6 i ATCCTGAGCCCCGGCTTCCCAGGTTCTTACCCCAACAACTTAGACTGCACCTGGAGGATC 4620 

1521 ILSPGFPGSYPNNLDCTWRI 1540 

4621 TCATTACCCATCGGCTATGGTGCACATATTCA 4680 

1541 SLPIGYGAHIQFLNFSTEAN 1560 

4681 CATGACTTCCTTGAAATTCAAAATGGACCTTACCA 4740 

1561 HDFLEI QN GPYHTSPMI GQF 1580 

4741 AGCGGCACGGATCTCCCCGCGGCCCTX^CTGAGC^ 4800 

1581 SGTDLPAALLSTTHETIiI HF 1600 

4 8 0 i TATAGTGACCATTCGCAAAACCGGCAAGGATTTAAAC 4860 

1601 YSDHSQNRQGFKLAYQAYEL 1620 

4861 CAGAACTGTCCAGATCCACCCCCATTTCAGAATGGGTACATGATCAACTCGGATTACAGC 4920 

1621 QNCPDPPP FQNGYMINSDYS 1640 

4921 GTGGGGCAATCAGTATCITTCGAGTGTTATCCTGGGTACATTCTAATAGG 4 980 

1641 VGQSVSFECYPGYILIGHPV 1660 

4981 CTCACTTGTCAGCATGGGATCAACAGAAACTGGAACTACCCTTTTCCAAGATGTGATGCC 5040 

1661 •!» T C Q H O I NRNWNYPF P R. C DA 1680 

5041 CCTTGTGGGTACAACGTAACTTCTCAGAACGGCACC ATCTACTCCCCTGGCTTTCCTGAT 5100 

1681 PCGYNVTS QNGTI YS PGF PD 1700 

5101 GAGTATCCGATCCTGAAGGACTGCATTTGGCTO 5160 

1701 EYP I LKDC I WLI TVP PGHGV 1720 

5161 TACATCAACTTCACCCTGTTACAGACGGAAGCTGTC7UICGATTACATTGCTGTTTGGGAC 5220 

1721 YINFTLIiQTEAVNDYIAVWD 1740 

5221 GGTCCCGATCAGAACTCACCCCAGCTGGGAGTTTTCAGTGGCAACACAGCCCTCGAAACG 5280 

1741 GPDQNS PQLGVFSGNTAL ET 1760 

5281 GCGTATAGCTCCACCAACCAAGTCCTGCTCy^GTTCCACAGCGACTTTTCAAATGGAGGC 5340 

1761 AYSSTNQVLLKFHSDFSNGG 1780 

5341 TTCTTTGTCCTCAATTTCCACGCATTTCAGCTCAAGAAATGTCAACCTCCCCCAGCGGTT 5400 

1781 FFVL N FHA FQL KKCQ P P P AV 1800 

5401 CCACAGGCAGAAATGCTTACTGAGGATGATGAT^ 5460 

1801 PQAEMLTE DDDFEIGDFVKY 1820 

5461 CAGTGCCACCCCGGGTACACCTTGGTGGGGACCGACATTCTGACTTGCAAGCTCAGTTCC 5520 

1821 QCHPGYTLVGTDI LTCKLSS 1840 

5521 CAGTTGC^GTTTGAGGGTTCTCTCCCAACATGTGAAGCACAATGCCCAGCAAA 5580 

1841 QLQFEGSLPTCEAQCPANEV 1860 

5581 CGGACTGGATCATCGGGAGTCATTCTCAGTCCAGGGTATCCGGGTAAT^ 5640 

1861 RTGSSGVI LSPGYPGNYFNS 1880 

5641 (^GACTTGCTCTTGGAGTATTAAAGTGGAACCAA^ 5700 

1881 QTCSWSIKVEPNYNITI FVD 1900 
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.570} ACATTTCAAAGTGAAAAGCAGTTTGATGCACTGGAAGTGTTTGATGGTTCTTCTGGGCAA 5760 

1901 TFQSE KQFDALEVFDGSSGQ 1920 

5761 AGTCCTCTGCTAGTAGTCTTAAGTGGGAATCATACTGAAGAATCAAATTTTACAAGCAGG 5820 

1921 SPLLVVLSGNHTEQSNFTSR 1940 

5 8 2 1 AGTAATCAGTTATATCTCCGCTGGTCCACTGACCATGCCACCAGTAAGAAAGGATTCAAG 5880 

1941 SNQLYLRWSTDHATSKKGFK 1960 

5881 ATTCGCTATGCAGCACCTTACTGCAGTTTGACCCACCCCCTGAAGAATGGGGGTATTCTA 5940 

1961 I R Y A A P Y C SIiTH P.LKNGG I L 1980 

5941 AACAGGACTGCAGGAGCGGTTGG AAGCAAAGTGC ATTATTTTTGCAAGCCTGGATACCGA 6000 

1981 NRTAGAVGSKVHYFC KPGYR 2000 

6001 ATGGTCGGCC7VCAGCAATGCAACCTGTAGACGAAACCCACTTGGCATGTACCAGTGGGAC 6060 

2001 MVGH S NATCR RNPLGMYQWD 2020 

6061 TCCCTCACGCCACTCTGCCAGGCTGTGTCCTGTGGAATCCCAGAATCCCCAGGAAACGGT 6120 

2021 SLTPLCQAVSCGI PESPGNG 2040 

6121 TCATTTACCGGGAACGAGTTCACTTTGGACAGTAAAGTGGTCTATGAATGTCATGAGGGC 6180 

2041 SFTGNE FTLDSK VVY ECHEG 2060 

6181 TTCAAGCTTGAATCCAGCCAGCAAGCAACAGCCGTGTGTCAAGAAGATGGGCTGTGGAGT 6240 

2061 FKLE SSQ QATA VCQEDGIiWS 2080 

6241 AACAAGGGGAAGCCGCCCACGTGTAAGCCGGTCGCTTGCCCCAGCATTGAAGCTCAGCTC 6300 

2081 NK GKP PTCKPVACPS I EAQL 2100 

63 01 TCAGAACATGTCATCTGGAGGCTGGTTTCAGGATCCTTGAATGAGTACGGTGCTCAAGTA 6360 
2101 SEH-VIWRLVSGSLNEYGAQV 2120 

6361 TTGCTGAGCTGCAGTCCTGGTTACTACTTAGAAGGCTGGAGGCTCCTGCGGTGCCAGGCC 6420 

2121 LLSCS PGYYLEGWRLLRCQA 2140 

6421 AATGGGACGTGGAACATAGGAGATGAGAGGCCAAGCTGTCGAGTTATCTCGTGTGGAAGC 6480 

2141 NGTWN I GDERPSCRVI S CGS 2160 

64 81 CTTTCCTTTCCCCCAAATGGCAACAAGATTGGAACGTTGACAGTTTATGGGGCCACAGCT 6540 
2161 LS F P PNGNKI GTLTVYGATA 2180 

6541 ATATTTACGTGCAACACCGGCTACACGCTTGTGGGGTCTCATGTCAGAGAGTGCTTGGCA 6600 

2181 IFTCNTGYTLVGSHVRECLA 2200 

6601 AATGGGCTCTGGAGCGGCAGCGAAACTCGATGTCTGGCTGGCCACTGCGGTTCCCCAGAC 6660 

2201 NG LWSGSETRCLAGHCGSPD 2220 

6661 CCGATTGTGAACGGTCACATTAGTGGAGATGGCTTCAGTTACAGAGACACGGTGGTTTAC 6720 

2221 PIVNGHI SGDGFSYRDTVVY 2240 

6721 CAGTGCAATCCTGGTTTCCGGCTTGTGGGAACTTCCGTGAGGATATGCCTGCAAGACCAC 6780 

2241 QCNPGFRLVGTSVRI CLQDH 2260 

6781 AAGTGGTCTGGACAAACGCCTGTCTGTGTCCCCATCACATGTGGTCACCCTGGAAACCCT 6840 

2261 KWSGQTPVCVPITCGHPG NP 2280 
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6841 GCCCACGGATTCACTAATGGCAGTGAGTTCAACCTGAATGATGTCGTGAATTTCACCTGC 6900 

2281 AHGF TNGSEFNLNDVVNFTC 2300 

6901 AACACGGGCTATTTGCTGCAGGGCGTGTCTCGAGCCCAGTGTCGGAGCAACGGCCAGTGG 6960 

2301 NTGYbLQGVSRAQC R SNGQW 2320 

6961 AGTAGCCCTCTGCCCACGTGTCGAGTGGTGAACTGTTCTGATCC AGGCTTTGTGGAAAAT 7020 

2321 SS PLPTCRVVNCSDPGFVEN 2340 

7021 GCCATTCGTCACGGGCAACAGAACTTCCCTGAGAGTTTTGAGTATGGAATGAGTATCCTG 7080 

-4 It T- *-» »*■ /~\ »<T ^> T> T?» O T7» T7» V «« O T T 1 A 

zj4i ax it. n o V w v jt o e> r j. »'J o j. • jj. i-»uu 

7081 TACCATTGC7VAGAAGGGATTTTACTTGCTGGGATCTTC AGCCTTGACCTGTATGGCAAAT 7140 

2361 YHCKKGFYLIiGSSALTCMAN 2380 

7141 GGCTTATGGGACCGATCCCTGCCCAAGTGTTTGGCTATATCGTC 72 00 

2381 GLWD RS L P KCLA I S C G H P GV 2400 

7201 CCTGCCAACGCCGTCCTGACT*GGAGAGCTGTTTACCTATGGCGCCGTCGTGCACTACTCC 7260 

2401 PANAVLTG BLFTYGAVVHYS 2420 

7261 TGCAGAGGGAGCGAGAGCCTCATAGGC7^ACGACACGAGAGTGTGCCAGGAAGACAGTCAC 7320 

2421 CRGSESLIGNDTRV.CQEDSH 2440 

7321 TGGAGCGGGGCACTGCCCCACTGCACAGGAAATAATCCTGGATTCTGTGGTGATCCGGGG 7380 

2441 WSGAIjPHCTGNNPGFCGDPG 2460 

7381 ACCCCAGCACATGGGTCTCGGCTTGGTGATGACTTTAAGACAT^AGAGTCTTCTCCGCTTC 7440 

2461 TPAHGS RLGDDFK TKSLLRF 2480 

7441 TCCTGTGAAATGGGGCACCAGCTGAGGGGCTCCCCTGAACGCACGTGTTTGCTCAATGGG 7500 

2481 SCEMGHQLRGSPERTCIiliNG 2500 

7501 TCATGGTCAGGACTGCAGCCGGTGTGTGAGGCCGTGTCCTGTGGCAACCCTGGCACACCC 7560 

2501 SWSGLQ PVC EAVS CGN PGTP 2520 

7561 ACCAACGGAATGATTGTCAGTAGTGATGGCATTCT^ 7620 

2521 TNGMIVSSDGILFSSSVIYA 2540 

7621 TGCTGGGAAGGCTACAAGACCTCAGGGCTCATGAC^ 7680 

2541 CWEGYKTSGLMTRHCTANGT 2560 

7681 TGGACAGGCACTGCTCCCGACTGCACAATTATAAGTTGTGGGGATCCAGGCACACTAGCA 7740 

2561 WTGTA PDCTI ISCGDPGTLA 2580 

7741 AATGGCATCCAGTTTGGGACCGACTTCAC^ 7800 

2581 NGIQFGTDFTFNKTVSYQCN 2600 

7801 CC^GGCTATGTCATGGAAGC^GTCACATCCGCCACTATTCGCTGTACCAAAGACGGCAGG 7860 

2601 PGYVMEAVTSATI RCTKDGR 2620 

7861 TGGAATCCGAGC7^AACCTGTCTGCAAAGCCGTGCTGTGTCCTCAGCCGCCGCCGGTGCAG 7920 

2621 WNPSKPVCKAVLCPQPPPVQ 2640 

7921 AATGGAACAGTGGAGGG AAGTGATTTC CGCTGGGGCTC CAG CATAAGTTACAGCTGCATG 7980 

2641 NGTVEGSDFR WGS S I SYS CM 2660 
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7981 GACGGTTACCAGCTCTCTCACTCCGCCATCCTCTCCTGTG7VAGGTCGCGGGGTGTGGAAA 8040 

2661 DGYQLSHSAI LSCEGRGVWK 2680 

8041 GGAGAGATCCCCCAGTGTCTCCCTGTGTTCTGCGGAGACCCTGGCATCCCCGCAGAAGGG 8100 

2681 GE I PQCLPVFCGDPGI PAEG 2700 

8101 CGACTTAGTGGGAAAAGTTTCACCTATAAGTCCGAAGTCTTCTTCCAGTGCAAATCTCCA 8160 

2701 RIiS" GKS FTYKSEVFFQ.CKSP 2720 

8161 TTTATACTCGTGGGATCCTCCAGAAGAGTCTGCCAAGCTGACGGCACGTGGAGCGGCATA 8220 

2721 FILVGSSRRVCQADGTWSGI 2740 

8221 CAACCCACCTGCATTGATCCTGCTCATAACACCTGCCCAGACCCTGGTACGCCACACTTT 8280 

2741 QPTCIDPAHNTCPDPGTPHF 2760 

8281 GGAATACAGAATAGCTCCAGAGGCTATGAGGTTGGAAGCACGGTTTTTTTCAGGTGCAGA 8340 

2761 G I 0 N S S RGY EVG S TV FFRCR 2780 

8341 AAAGGCTACCATATTCAAGGTTCCACGACTCX3CACCTGCCTTGCCAATTTAACATGGAGT 8400 

2781 KGYHIQGSTTRTCLANLTWS 2800 

8401 GGGATACAGACCGAATGTATACCTCATGCCTGCAGACAGCCAGAAACCCCGGCACACGCG 8460 

2801 GI QTECI PHACRQPETPAHA 2820 

8461 GATGTGAGAGCCATCGATCTTCCTACTTTCGGCTACACCTTAGTGTACACCTGCCATCCA 852 0 

2821 DVRA I DIiPTFGYTLVYTCHP 2840 

8521 GGCTTTTTCCTCGCAGGGGGATCTGAGCACAGAACATGTAAAGCAGACATGAAATGGACA 8580 

2841 GFFLAGGSE HRTCKADMKWT 2860 

8581 GGAAAGTCGCCTGTGTGTAAAAGTAAAGGAGTGAGAGAAGTTAATGAAACAGTTACTAAA 8640 

2861 GKSPVCKSKGVREVNETVTK 2880 

8641 ACTCCAGTTCCTTCAGATGTCTTTTTCGTCAATTCACTGTGGAAGGGGTATTATGAATAT 8700 

2881 TPVPSDVFFVNSLWKGYY.EY 2900 

8701 TTAGGGAAAAGAC^CCCGCCACTCTAACTGTTGACTGGTTCAATGCAACAAGCAGTAAG 8760 

2901 LGKRQPATLTVDWFNATSSK 2920 

8761 GTGAATGCCACCTTCAGCGAAGCCTCGCCAGTGGAGCTGAAGTTGACAGGCATTTACAAG 8820 

2921 VNATFSEAS PVELKIiTGIYK 2940 

8821 AAGGAGGAGGCCCACTTACTCCTGAAAGCTTTTCAAATTAAAGGCCAGGCAGATATTTTT 8880 

2941 KE EAH LLL KAFQ I KGQADI F 2960 

8881 GTAAGCAAGTTCGAAAATGACAACTGGGGACTAGATGGTTATGTGTCATCTGGACTTGAA 8940 

2961 VSKFENDNWGLDGYVSSGLE 2980 

8941 AGAGGAGGATTTACTTTTCAAGGTGACATTCATGGAAAAGACTTTGGAAAATTTAAGC 9000 

2981 RGGFTFQGDIHGKDFGKF KL 3000 

9001 GAAAGGCAAGATCCTTTAAACCCAGATCAAGACTCTTCCAGTCATTACCACGGCACCAGC 9060 

3001 ERQDPLNPD QDSSSHYHGTS 3020 

9061 AGTGGCTCTGTGGCGGCTGCCATTCTGGTTCCTTTCTTTGCTCTAATTTTATCAGGGTTT 9120 

3021 S G SVAAA I LV P F FAL I LSGF 3040 
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GCATTTTACCTCTACAAACACAGAACGAGACCAAAAGTTCAATACAATGGCTATGCTGGG 9180 
AFYLYKHRTRPKVQYNGYAG 3060 



9181 CATGAAAACAGCAATGGACAAGCATCGTTTGAAAACCCCATC^ 9240 

3061 HENSNGQASFENPMYDTNLK 3080 

9241 CCCACAGAAGCCAAGGCTGTGAGGTTTGACACAACTCTGAACACAGTCTGTACAGTGGTA 9300 

3081 PTEAKAVRFDTTLNTVCTVV 3100 



9301 TAGCCCTCAGTGCCCCAACAGGACTGATTCATAGCCATACCTCTGATGGACAAGCAGTGA 9360 

3101 * 3101 

9361 TTCCTTTGGTGCCATATACCACTCTCCCYTCCACTCTGGCTTTACTGCAGCGATCTTCAA 9420 

942 1 CCTTGTCTACTGGCATAAGTGCAGCGGGGATCTCTACTCAAATGTGTCAGGGTCTTCTAC 9480 

94 81 GGATCAAACTACACATGCGTTTTCATTCCAAAAGTGGGTTCTAAATGCCra 9540 

9541 GTATGAAATCAAGGCACACTCCAGGAAGACTGCCACGTCGCGC 9600 

9601 TRCCTCAGACTTTCATATTTCTGTGTTGCT 9660 

9661 CGTGGATATGTCCCTCAGGTGCGGTGACAGAATGGTGGCACCACGATATGTGTTCTCTTG 9720 

9721 TGTTGTTTTTCCTTTTTAAACCCCCATGAACACGAAT^ 9780 

9781 TCTGGAAGAAGACACCTTTCTGATAGAGGCTCACACCTACAAATGCTTCACTCTGTCCTT 9840 

9841 CCGAGACCTGACAAGCTTTGAGGACCTCACAGCTCCCCTGTGTGTTCATCTCTAGGGATG 9900 

9 90 1 TTTGCAATTTCCCAGTGAGCTGTTCTGTCGCAGAATGTTTA 9960 

9961 AGTGTGTTATGAATGACTAAGATTCTGATAAAAAAAATAAATTATTTACACAGGGTTTAT 10020 

10021 ACACACTATCCATTGTATATAAGCATTATTTCATATTATCAAGCTAT^ACATTCCCCCATC 10080 

10081 AGCTTAGTTGGAGTGTTAGGGAAAAGTATTCCTAGATATGGCACAGATTTTA 10140 

10141 TACAGTATTGACGAGATTTATTTTATTATTGCTTCAATTAGCTCCATTTACGTGTTG^^ 10200 

10201 TCATTGAAGAGGTCCAATGAGAAAAAAACAGAAGCCTCCTTATTTCACACGTTTTCCTCC 10260 

10261 TTTAGTACCATCCTCATCCAATTACTGTCTCTCTGATACTACTTAATAGCAGGGGGTTTG 10320 

10321 CAGAAATTTCTGTTTGCCATGTAAAACTGTGAATAGTAATTTATTTTAGATAGTCGATGA 10380 

10381 ACTTGTGGGTTTTAGCTCACAATGCAGCCTTCCCT^ 10433 
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Map of Rat C3b/C4b Complement Receptor like cDNA (SEQ ID NO; 3) and 

Amino Acid Sequences (SEQ ID NO: 4) 

1 GATGCCGGGAAGGTGGGGGACACCAGATCCGTCTTGTACGTGCTTACAGGCTCCAGTGTC 6 0 

1DAGKVGDTRSVLYVLTGSSV 20 

6 1 CCTGACCTCATCGTGAGCATGAGCAATCAGATGTGGCTCCACCTGCAGTCAGACGACAGC 120 

2IPDLIVSMSNQMWLHI1QSDDS 40 

121 ATTGGTTCCCCAGGATTTAAAGCTGTGTACCAAGAAATCGAGAAGGGAGGCTGCGGGGAC 180 

41 I G S PGF KAVYQE I E KGGC GD 60 

181 CCTGGCATCCCAGCCTACGGGAAGCGGACTGGCAGCAGCTTCTTGCACGGGGACACGCTC 240 

61PGI PAYGKRTGSSFLH GDTL 80 

241 ACCTTTGAGTGCCAGGCAGCTTTTGAGCTGGTAGGAGAGAGAGTGATTACGTGCCAGAGA 300 

81TFEC QAAF ELVGERVITCQR 100 

301 AACAACCAGTGGTCCGGCAACAAGCCAAGCTGTGTGTTTTCATGTTTCTTCAACTTCACG 360 

101 NNQWSGNKPSCVFSCFFNFT 120 

361 GCGTCCTCTGGGATCATCCTGTCGCCAAACTATCCTGAGGAATATGGCAACAACATGAAT 420 

121 AS SGI IL SPNYPEEYGNNMN 140 

421 TGTGTGTGGTTGATTATATCTGAGCCCGGGAGCCGGATTCACCTCATCTTCAATGATTTC 480 

141 CVWLII SEPGSRIHLIFNDF 160 

481 GATGTGGAGCCTC^GTTTGACTTCCTTGCGGTCAAAGATGATGGGATTTCTGACATCACA 540 

161 DVEPQFD FLAVKDD G I SD IT 180 

541 GTCCTCGGGACTTTCTCTGGCAATGAGGTGCCTGCACAGCTGGCC . GCAGTGGACACATA 600 

181 VIiGTFS GNEVPAQLAXSGHI 200 

601 GTACGCCTGGAGTTTCAGTCCGATCACTCTACCACGGGCAGAGGGTTCAACATCATATAC 660 

201 VRLEFQSDH STTGRGFNI IY 220 

661 ACCACATTTGGTCAGAACGAGTGTCATGACCCTGGGATCCCTGTGAATGGACGGCGCTTT 720 

221 TT FGQNECHDPGI PVNGRRF 240 

721 GGAGACAGGTTTCTGCTGGGAAGTTCTGTGTCCTTCCACTGTGATGATGGCTTTGTGAAG 780 

241 GDRFLLGSSVSFHCDDGFVK 260 

781 ACTCAGGGTTCTGAGTCTATCACATGCATCTTGCAAGATGGAAACGTGGTCTGGAGCTCT 84 0 

261 TQGSESITCILQDGNVVWSS 280 

841 ACTGTCCCTCGCTGTGAAGCTCCTTGTGGTGGGCATCTGACAGCTTCTAGTGGGGTCATA 900 

281 TVPRCEAPC GGHLTASSGVI 300 

901 TTACCTCCAGGATGGCCAGGATATTACAT^AGATTCTTTAAATTGCGAATGGGTCATTGAA 960 

301 LPPGWPGYYKDSLNCEWVIE 320 

961 GCCAAACCAGGACATTCCATCAAAATAACATTTGAC^GGTTCCAGACAGAAGTC^TTAT 102 0 

321 AKPGHS I KI TFDRFQTEVNY 340 

1021 GATACTCTGGAAGTCCGGGATGGGCCAACCAGCTCATCCCCACTGATTGGGGAGTACCAT 1080 

341 DTLEVRDGPTSSSPLIGEYH 360 
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108} GGCACCCAGGCTCCACAGTTCCTCATCAGCACAGGGAACTACATGTACCTGCTGTTTACC 1140 

361 GTQAPQFLiI STGNYMYkLFT 380 

1141 ACTGACAGCAGCCGCGCTAGTGTTGGCTTCCTCATCCACTATGAGAGTGTGACTCTTGAA 1200 

381 TDSSRASVGFLIHYESVTLE 400 

1201 TCTGACTCCTGTCTGGACCCGGGCATCCCTGTAAATGGTCATCGGCATGGCAGTAACTOT 1260 

401 SDSCLDPGI PVNGHRHGS. NF 420 

1261 GGTATCAGATCTACAGTGACCTTCAGCTGTGACCCTGGGTACACGCTCAGTGATGACGAT 1320 

42XGXRSTVTFSCDPGYTX>SDDD 440 

1321 CCCCTCATCTGTGAGAAGAACCATCAGTGGAACCACGCCTTGCCCAGCTGTGATGCCCTG 1380 

441 PLICEKNHQWNHALPSCDAL 460 

1381 TGTGGAGGCTACATCGATGGAAAGAGTGGGACTGTTCTTT 144 0 

461 CGGYI HGKSGTVIjSPGFPDF 480 

1441 TATCCAAACTCTCTGAACTGTACATGGACCATTGAAGTCTCTCATGGCT^AGGGAGTGCAG 1500 

481 YPNSIiNCTWTIEVSHGKGVQ 500 

1501 ATGAATTTCCACACCTTTCACCTTGAAAGTTC 1560 

501 MNFHTFHLES SHDYLLITED 520 

1561 GGGAGTTTCTCAGAGCCGGTAGCCAGGCTCACTGGGTCGGTCCTGCCT(^CACCATTAAG 1620 

521 GSFSEPVARLTGSVIjPHTIK 540 

1621 GCTGGCTTGTTTGGAAACTTCACTGCGCAACTCAGGTTCATCTCTGACTTCTCCATCTCC 1680 

541 AG LFGNFTAQLRFISDFSIS 560 

1681 TATGAAGGCTTCAACATTACGTTTGCAGAATATGACCTAgAACCCTGTGATGACCCTGGA 1740 

561 YEGFNITFAEYDLEPCDDPG 580 

1741 GTCCCTXX^CTACAGTCGCAGAATTGGGTTCCAGTTCGGTGTGGGTGACACCCTGGCTTTC 1800 

581 VPAYS RRI GFQFGVGDTIiAF 600 

1801 ACCTGCTTCCAGGGATACCGCTTAGAAGGTGCAACCAAGCTTACCTGCCTGGGTGGGGGA 1860 

601 TCFQGYRLEGATKLTCIjGGG 620 

1861 CGCCGAGTGTGGAGTGCACCTCTGCC^GGTGTGTGGCTGAATGTGGAGCAAGC 192 0 

621 RRVWSAPL PRCVAECGASVK 640 

1921 GGAAATGAAGGAACATTACTCTCTCCTU^TTTCCCATCCAATTATGATAATAACCATGAG 1980 

641 GNEGT IiLS PNFPSNYDNNHE 660 

1981 TGTATCTATAAAATAGAAACAGAAGCCGGAAAGGGGATCCATCTCAGAGCCCX3AACCTTC 2040 

661 CIYKI E TEAGKGIHIjRARTF 680 

2041 CAACTCTTCGAAGGAGA(^CTCTAAAGGTTTATGATGGAAAGGACAGCTCCTCGAGGTCA 2100 

681 QLFEGDTLKVYDGKDSSSRS 700 

2101 CTGGGAGTCTTCACAAGAAGTGAACTGATGGGGC^ 2160 

701 LGV FTR S E LMGLVLN S TS NH 720 

2161 CTGAGGCTGGAGTTCAACTCTAACGGGTCAGATACCGCCC^GGCTTCCAGCTCACCTAC 2220 

721 LRIiE FN SN GSDTAQ GFQLTY 740 
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2221 ACCAGTTTTGACCTAGTGAAATGTGAGGATCCAGGCATCCCTAACTATGGCTACAGGATC 2280 

741 TSFDLVKCEDPGI PNYGYRI 760 

2281 CGAGATGATGGTCACTTCACAGACACTGTGGTTCTCTACAGCTGCAACCCAGGCTACGCA 2340 

761 RDDGHFTDTVVIiYSCNPGYA 780 

2341 ATGCATGGCAGCAGTACCC t GACC t GCCTGAGTGGGGACCGAAGGGTGTGGGACAAACCT 2400 

781 MHGS STLT CLSGDRRVWDKP 800 

24 01 ATGCCTTCCTGTGTGGCGGAATGTGGTGGTCTCGTCCATGCAGCCACATCAGGACGCATA 2460 

801 MPS CVAECGGLVHAATSGRI 820 

2461 CTCTCTCCTGGCTACCCTGCCCCATATGACAAC7VACCTTCATTGCACTTGGACCATAGAG 2520 

821 LSPGYPAPYDNNLHCTWTIE 840 

2521 GCTGATCCTGGCAAGACCAYCAGCCTCCATTTCATTGTGTTTGACACTGAAACGGCGCAC 2580 

841 ADPGKTXSLHFIVFDTETAH 860 

2581 GACATCCTC/LAGGTCTGGGATGGTCCAGTGGACAGCAACATCCTGCTGAAGGAGTGGAGC 2640 

861 DILKV WDGPVDSNILLKEWS 880 

2641 GGCTCGGCCCTTCCTGAGGACATCCACAGCACCTTCAACTCGCTCACCCTGCAGTTCGAT 2700 

881 GSALPEDIHSTFNSLTLQFD 900 

2701 AGTGACTTCTTCATCAGCAAGTCCGGCTTCTCCATCCAGTTCTCTACTTCCATTGCATCC 2760 

901 SDFFISKSGFSIQFSTSIAS 920 

2761 ACCTGCAATGACCCTGGGATGCCTCAGAATGGAACCCGCTATGGTGACAGCCGGGAACCT 2820 

921 TCND PGMPQNGTRYGDSREP 940 

2821 GGAGACACCATCACCTTCCAGTGTGACCCTGGATACCAGCTCCAAGGGCAAGCCAAGATC 2880 

941 GDTI TFQCDPGYQLQGQAKI 960 

2881 ACTTGTGTGCAGCTTAACAACCGCTTCTTCTGGCAACCAGACCCTCCGTCATGCATAGCT 2940 

961 TCVQ LNNRF F WQPD P P S C IA 980 

2941 GCTTGTGGTGGGAATCTGACAGGCCCTGCTGGAGTGATTTTATCCCCAAACTACCCACAG 3000 

981 ACGGNLTGPAGVI LS PNYPQ 1000 

3001 CCATACCCTCCTGGGAAGGAGTGTGACTGGAGAATTAAGGTGAACCCAGACTTTGTCATT 3060 

1001 PYPPGKECDWRIKVNPDFVI 1020 

3061 GCCTTAATATTCAAAAGTTTTAGCATGGAGCCAAGTTACGACTTCCTGCATATCTATGAA 3120 

1021 A L I FKSFSMEPSYDFIiHIYE 1040 

3121 GGGAAGGACTCCAACAGCCCACTGATCGGAAGCTTCCAGGGTTCTCAAGCCCCAGAGAGG 3180 

1041 GKDSNS PLIGSFQGSQAPER 1060 

3181 ATTGAGAGCAGTGGTAACAGCCTCTTCCTGGCATTCAGGAGTGATGCCTCTGTTGGCCTG 3240 

1061 IESSGNSLFLAFRSDASVGL 1080 

3241 TCCGGGTTTGCCATTGAATTTAAAGAGAAACCACGGGAAGCTTGCTTTGACCCTGGGAAC 3300 

1081 SGFAI EFKEKPREACFDPGN 1100 

3301 ATAATGAACGGGACAAGGATTGGAACGGACTTTAAGCTGGGCTCTACAGTTACCTATCAA 3360 

1101 IMNGTR I GTDFKLGSTVTYQ 1120 
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3361 TGTGACTCTGGTTACAAGATTGTGG 3420 

1121 CDSGYKIVDPSSIECVTGAD 1140 

3421 GGGAAGCCGTCCTGGGACCGGGCACTGCCTGCCTGCCAAGCACCCTGTGGAGGCCAATAC 3480 

1141 GKPSWDRALPACQAPCGGQY 1160 

3481 ATGGGCTCGGAGGGGGTAGTTTTGTCACCAAACTACCCTCATAACTACACGGCTGGGCAG 3540 

1161 MGS EGVVLS PNY PHNYTAGQ 1180 

3541 ATATGCATCTATTCCATCACGGTGCCCAAGGAATT^ 3600 

j. _L \-> V. J- W-» A * • *. •> - ■ » * »— *■ ** *> ^ *» 

3601 TTCCAGACTGCGCTGAACGACTTGGCAGAATTGTTTGATGGAACCCATCCTCAGGCCA^ 3660 

1201 FQTALN DIiABLFDGTHPQAR 1220 

3661 * CTTCTCAGTTCTCTCTCTGGTTCCCATTCAGGTGAAACACTCCC 3720 

1221 I, IiSSL.SG SHSGETIiPIj .ATSN 1240 

3721 CAGATTCTGCTTCGCTTGAGCGCAAAGAG 3780 

1241 QILIiRFSAKSGASARGFHFV 1260 

3781 TACCAAGCCGTCCCACGCACCAGTGACACGCAGTGCAGCTCCGTCCCTGAGCCCAGATAT 3840 

1261 YQAVPRTSDTQCSSVPEPRY 1280 

3841 GGGAGAAGGATTGGTTCTGAGTTCTCTGCAGGCTCCATCGTCCGATTCGAGTGCAACCCA 3900 

1281 GRRIGSEFSA GS IVRFECNP 1300 

3901 GGTTACCTGCTGGAAGGCTCCACAGCCATCCGTTGTCAGT 3960 

1301 GYLLQGSTAIRCQSVP NALA 1320 

3961 CAGTGGAATGACACCATCCCAAGCTGTGTAGTTCCATGCAGTGGCAATTTCACTCAGAGA 4020 

1321 QWNDTIPSCVVPCSGNFTQR 1340 

4021 AGAGGGACAATCTTATCTCCAGGCTACCCTGAGCCCTATGGGAACAACCTGAACTGTGTA 4080 

1341 RGTI LiSPGYPEPYGNNLNCV 1360 

4081 TGGAAGATCATAGTATCGGAGGGCTCAGGGATCCAGATCCAAGTGATTAGCTTTGCCACG 4140 

1361 WKI IVSEGSGIQIQ VISFAT 1380 

4141 GAGCAGAACTGGGACTCCCTGGAGATCCATGACGGAGGAGACATGACGGCCCCCAGACTG 4200 

1381 EQNWDSL EIHDGGD MTAPRL 1400 

4201 GGCAGCTTCTCAGGTACC^CAGTGCCCGCACTGCTGAATAGCACCTCCAACCAGCTCTGC * 4260 

1401 GS F S G TTVPALLN S T SNQLC 1420 

4261 CTGCACTTCCAGTCGGAGATCAGTGTTGCCGCT 4320 

1421 LHFQSDI SVAAAGFH LEY KT 1440 

4321 GTGGGTCTGGCTGCGTGCCAGGAACCTGCTCTCCCGAGCAACGGCATCAAGATAGGAGAC 4380 

1441 VGIiAACQEPALPSNG I KIGD 1460 

4381 CGCTATATGGTGAACGATGTGCTGTCCTTCCAGTGCGAGCCTGGGTACACCTTGCAGGGC 4440 

1461 RYMVNDVLS FQCEPGYTIiQG 1480 

4441 CGCTCACACATTTCTTGTATGCCGGGAACTC^ 4500 

1481 RSHISCMPGTVRRWNYPSPL 1500 
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4501 TGCATTGCCACCTGTGGTGGGACACTGACCAGCATGAGTGGAGTGATCCTGAGCCCAGGC 4560 

1501 CIATCGGTLTSMSGVILSPG 1520 

4561 TTCCCAGGGTCATACCCCAACAACCTGGACTGCACCTGGAAGATATCCCTGCCCATTGGC 4620 

1521 FPG SYPN NLDCTWKI SLPIG 1540 

4621 TATGGTGCACATATCCAATTTCTGAATTTCTCAACTGAAGCCAACCATGACTACCTGGAG 4680 

1541 YGA H I Q FLiNF STEANH. DYLE 1560 

4681 ATCCAGAATGGCCCTTACCACAGTAGTCCAATGATGGGACAGTTCAGTGGCCCTGACCTG 4740 

1561 IQNGPYHSSPMMGQ FS GPDL 1580 

4741 CCTGCGTCACTGCTGAGCACCACACATGAAACCCTCATCCGCTTCTATAGTGACCACTCA 4800 

1581 PAS LliSTT HETLI RFYSDHS 1600 

4 801 CAGAACCGACAAGGATTTAAACTCAGTTACCAAGCTTATGAGTTACAGAACTGCCCGGAC 4860 

1601 Q NR QGFKIiSYQAYEIjQNCPD 1620 

4861 CCACCCGCATTCCAGAATGGGTTCATGATCAACTCCGATTACAGCGTGGGCCAGTCGATC 4920 

1621 PPA FQNGFMINSDYSVGQSI 1640 

4921 TCATTTGAGTGCTACCCGGGCTACATCTTGCTAGGCCACCCTGTGCTCACCTGCCAGCAT 4980 

1641 SFE CYPGYILLGHPViiTCQH 1660 

4981 GGCACTG AC AGGAACTGGAACTACCCTTTCCCACGGTGTGACGCTCCCTGTGGGTATAAT 5040 

1661 GTDRNWNYPFPRCDAPCGYN 1680 

5041 GTG ACATCACAGAATGGCACCATTTATTCCCCTGGGTTCCCAGACGAGTATCCAATTCTG 5100 

1681 VTS QNGT I Y S P'G F PDE YP I L 1700 

5101 AAGGACTGCCTGTGGCTGGTCACTGTCCCTCCAGGACATGGAGTGTACATCAACTTCACC 5160 

1701 KDCLWLVTVPPGHGVYINFT 1720 

5161 TTGCTGC AGACTGAGGCTGTAAATGACTACATCGCTGTGTGGGATGGTCCTGACCAGAAC 5220 

1721 LLQ TEAVNDYIAVWD GPD QN 1740 

5221 TCGCCTCAGCTCGGGGTCTTCAGTGGA^CACTGCCCTCGAGACAGCATACAGCTCCACC 5280 

1741 SPQ LGVFSGNTALiETAYSST 1760 

5281 AACCAGGTCTTGCTCAAATTCC^CAGCGATTTCTCCAATGGAGGCTTCTTTGTCCTCAAT 5340 

1761 NQVLLKFHSDFSNGGFFVLN 1780 

5341 TTTCATGCATTTCAACTGAAGAGGTGCCCGCCTCCTCCAGTAGTGCCGCAGGCTGACCTG 54 00 

1781 F HA F 'QIiKRCPPPPVVPQADL 1800 

5401 CTTACAGAAGATGT^GACTTTGAAATAGGGGACTTCGTAAAGTACCAGTGCCATCCAGGG 5460 

1801 LTEDEDFEIGDFVKYQCHPG 1820 

5461 TAC^CGCTGTTGGGAAGTGACACCCTGAC^TGCAAGCTC^GCTCAC^GCTATTGTTCCAA 5520 

1821 YTLLGSDTIiTC'KL, SSQLLFQ 1840 

5521 GGCTCTCCACCTACCTGTGAAGCACAATGCCCAGCCAATGAAGTGCGAACAGAGTCTTCT 5580 

1841 GSPPTCEAQCPANEVRTES S 1860 

5581 GGGGTGATTCTCAGTCCTGGGTACCCAGGCAACTATTTTAACTCCCAGACATGTGCTTGG 5640 

1861 GVI LSPGYPGNYFNSQTCAW 1880 
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5 64 1 AGTATTAAAGTGGAGCCAAACTTTAACATTACGCTCTTTGTGGACACCT 5700 

1881 S I KVEPNFNI TLFVDTFQSE 1900 

5701 AAGCAATTTGATGC^CTGGAAGTATTTGATGGTTCTTC 5760 

1901 KQFDALEVFDGS SGQS PI>LV 1920 

5761 GTCTTAAGTGGGAACCACACTGAACAGTCCAATTTTACCAGCAGAAGTAACCATCTGTAC 5820 

1921 VIiSGNHTEQSN FTSRSNHLY 1940 

5821 CTCCGCTGGTCCACAGATCATGCAACCAGCAAGAAAGGATTCAAGATTCGCTATGCAGCT 5880 

1941 Ij R. W S T D K A "T S K K. G F K 1 R Y A A I960 

5881 CCTTACTGCAGCCTCACCTCTACACTC 5940 

1961 PYCSIiTSTLKNGGVIiNKTAG 1980 

5941 GCCCTGGGGAGCAAGGTGCAGTATTTCTGCAAGCCTGGATATCGAATGATTGGCCACAGC 6000 

1981 ALGSKVQYFCKPGYRMIGHS 2000 

6001 AACGCCACCTGCAGGCGGAACCCAGTGGGCGTGTACCAGTGGGACTCGATGGCACCGCTT 6060 

2001 NAT CRRNPVGVYQWD SMAPL 2020 

6061 TGCCAGGCTGTGTCCTGTGGAATTCCAGAGGCTCCAGGAAATGGCTCGTTCACAGGCAAT 6120 

2021 CQAV-SCGI PEAPGNGSFTGN 2040 

6121 GAGTTCACCTTAGACAGTAAAGTGACTTATGAATGTAATGAAGGCTTCAAGCTGGATGCC 6180 

2041 E FTLDSKVTYECNEG FKliDA 2060 

6181 AGTCAGCAAGCCACTGCTGTGTGTCAAGAAGATGGCCTGTGGAGCAACAGAGGAAAGCCA 6240 

2061 SQ QATAVCQEDGLWSNRGKP 2080 

6241 CCCACGTGCAAACCGGTGCCCTGCCCCAGCATCGAAGGCCAGCTGTCAGAGCACGTGCTC 6300 

2081 PTCKPVPCPSIEGQLSEHVL 2100 

6301 TGGAGGCTGGTTTCGGGATCATTGAATGAATATGGAGCTCAAGTTCTCCTCAGCTGTAGT 6360 

2101 WRLVSGSIiNEYGAQVbLSCS 2120 

6361 CCTGGCTACTTCTTGCAGGGTCAGAGGCTGTTGCAGTGCCAAGCCAATGGGACCTGGAAC 6420 

2121 PGY FLQGQRLliQCQANGTWN 2140 

6421 ACTGAGGAGGACAGACCCAGATGTAAAGTCATCTCCTGTGGAAGCCTGTCCTTTCCCCC^ 6480 

2141 TEEDRPRCKVI SCGSLSFPP 2160 

6481 AATGGTAACAAGATAGGGACX3CTCACTATGTATGGAGC CACCGCCATCTTTAC CTGCAAT 6540 

2161 NGNKIGTLTMYGATAI FTCN 2180 

6541 ACCGGCTACACACTTGTAGGCTCCCATGTCCGGGAGTGCTTGGCCAATGGTCTCTGGAGC 6600 

2181 TGYTLVGSHVRECLANGIiWS 2200 

6601 GGATCTGAAACAAGGTGCCTGGCGGGTCATTGTGGCTCTCCAGACCCCATTGTGAATGGC 6660 

2201 GSETRCLAGHCGSPDPIVNG 2220 

6661 CATATCAGTGGCGATGGCTTGAGCTACAGGGACACAGTC^ 6720 

2221 HISGDGFSYRDTVVYQCNPG 2240 

6721 TTTCGACTCGTAGGCACGTCTGTGAGGATTTGCCT 6780 

2241 FRLVGTSVRI CL.Q DHKWSGQ 2260 
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6 7 8 J. ACCCCCGTTTGCGTCCCCATCACATGTGGACACCCTGGAAACCCTGCCCATGGCCTCACC 6840 

2261 TPV.C VPITCGHPGNPAHGLT 2280 

6841 AACGGCAGCGAGTTCAACCTGAATGACCTTGTGAATTTCACCTGCCATACGGGCTACCTG 6900 

2281 NGSEFNLNDL.VNFTCHTGYL 2300 

6 9 0 1 CTGCAGGGTGCCTCCCGAGCCCAATGTCGGAGCAACGGCCAGTGGAGCAGCCCCTTGCCT 6960 

2301 LQGASRAQCRSNGQWSSPLP 2320 

6961 ATCTGCCGAGTGGTGAACTGTTCCGATCCTGGATTTGTGGAAAATGCAGTTCGCCACGGG 7020 

2321 ICRVVNCSDPGFVENAVRHG 2340 

702^ CAACAGAACTTTCCAGAGAGTTTCGAGTATGGGACAAGTGTGATGTATCACTGCAAGAAG 7080 

2341 QQNFPESFEYGTSVMYHCKK 2360 

7081 GGGTTCTACCTACTGGGCTCTTCTGCCCTGACCTGCATGGCAAGTGGCTTGTGGGACCGC 7140 

2361 G FYLLGS SA L TC MAS GLWDR 2380 

7141 TCCTTACCCAAGTGTCTGGCTATATCATGTGGGCATCCTGGGGTCCCCGCTAATGCTGTC 7200 

2381 SLPKCLAI S CGH PGVPANAV 2400 

7201 CTGACTGGAGAATTGTTTACATTTGGAGCCACAGTTCAGTACTCCTGCAAAGGGGGCCAG 7260 

2401 LTGEL FTFGATVQYSCKGGQ 2420 

7261 ATTCTCACAGGCAATAGCACAAGAGTCTGCCAAGAAGACAGTCACTGGAGTGGATCCCTT 7320 

2421 I LTGNSTRVCQEDSHWSGSli 2440 

7321 CCCCATTGTTCAGGAAATAGTCCTGGATTTTGTGGTGATCCAGGGACCCCAGCACATGGG 7380 

2441 PH CSGNSPG FCGDPGTPAHG 2460 

7381 TCTCGTCTTGGGGATGAGTTTAAGACAAAGAGTCTTTTGCGATTCTCCTGTGAGATGGGC 7440 

2461 SRLGDEFKT KSIiLRFSC EMG 2480 

7441 CACCAGCTGCGGGGTTCTGCAGAGCGCACATGCCTGGTGAATGGGTCCTGGTCAGGAGTC 7500 

2481 HQIjRGSAERTCLVNGSWSGV 2500 

7501 CAGCCTGTGTGTGAGGCCGTGTCCTGTGGAAACCCTGGCACCCCTACCAATGGGATGATC 7560 

2501 QPVCEAVSCGNPGTPTNGMI 2520 

7561 CTCAGCAGCGATGGAATCCTCTTCTCCAGCTCTGTCATCTATGCCTGCTGGGAAGGCTAC 7620 

2521 LSSDGIIiFSSSVIYA CWEGY 2540 

7621 AAGACCTCGGGGCTCATGACGCGGCACTGCACAGCGAACGGGACATGGACAGGCACAGCC 7680 

2541 KT SGLM TRH CTANGTWTGTA 2560 

7681 CCTGACTGTACAATC^TCAGCTGTGGTGATCCT 7740 

2561 PDCTI ISCGDPGTLPNGIQF 2580 

7741 GGGACAGACTTCACTTTCAACAAGACCGTGAGCTATCAGTGC^ACCCTGGCTACCTGATG 7800 

2581 GTDFTFNKTVSYQCNP GYLM 2600 

7801 GAGCCCCCAACATCACCCACC^TCCGCTGCACCAAAGATGGTACATGGAATCAGACCC^ 7860 

2601 E P PTS P TI RCTKDGTWNQTR 2620 

7861 CCCCTCTGCAAAGCTGTTCTATGCAGCCAGCCTCCCTCAGTGCCAAACGGAAAGGTGGAG 7920 

2621 PLCKAVLCSQPPSVPNGKVE 2640 
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7921 GGGTCAGACTTCCGATGGGGTGCCAGCATAAGCTACAGTTGTGTGGATGGCTACCAGCTC 7980 

2641 GSDFRWGAS I SYSCVDGYQL 2660 

7981 TCCCACTCGGCCATCCTGTCCTGTGAAGGGCGTGGAGTATGGAAAGGAGAAGTCCCTCAG 804 0 

2661 SHSA1 LiSCEGRGVW KGEV PQ 2680 

8041 TGCTTGCCTGTGTTCTGTGGCGATCCAGGCACTCCAGCAGAGGGACGGCTCAGTGGGAAA 8100 

2681 CLPVFCGDPGTPAEGRLSGK 2700 

8101 AGCTTC^CCTTTAAGTCTGAGGTCTTCATCCAGTGCAAACCCCCATTTGTGTT 8160 

2701 S FT F K S EV F I Q C KP P F V L V G 2720 

8161 TCCTCGAGGAGAACCTGCCAGGCCGATGGGATGTGGAGTGGCATCCAGCCCACTTGTATA 8220 

2721 SSRRTCQADG MWSGIQPTCI 2740 

8221 GATCCAGCCCACACCGCTTGCCCAGACCCCGGC^CTCCCC^CTTTGGAATACAGAATAGC 8280 

2741 DPAHTAC PDPGTPHFG .IQNS 2760 

8281 TCGAAAGGATACGAGGTTGGAAGCACTGTGTTCTTCAGATGTAGAAAAGGTTACCACATC 8340 

2761 S'KGYEVGSTVFFR-CRKGYHX 2780 

8341 CAAGGCTCCACTACCCGGACCTGTCTTGCGAAC^ 8400 

2781 QGS TTRTCIiANIiTWSGIQTE 2800 

8401 TGCATCCCCCATGCCTGCCGGCAGCCAGAGACCCCAGCGCATGCAGATGTGAGAGCCATC 8460 

2801 C I PHACRQPET PAHADVRAI 2820 

8461 GATCTTCCAGCTTTTGGCTAC7VCCTTAGTCTACACCTGTCATCCAGGATTTTTCCTTGCT 8520 

2821 DliPAFGYTLVYTCHPGFFIiA 2840 

8521 GGCGGATCTGAGCACAGGACGTGTAAAGCAGACATGAAATGGACAGGAAAGTCACCTGTT 8580 

2841 G GSE HRTCKADMKWTGKS PV 2860 

8581 TGTAAAAGTAAAGGAGTGAGAGAAGTTAATGAAAGAGTTACT^ 8640 

2861 CKS KGVREVNETVTKTPV PS 2880 

8641 GATGTATTTTTC ATCAACTCGGTGTGGAAGGGATATTATGAATATTTAGGCAAGAGACAG 8700 

2881 DVFFINSVW KGYYEYLGKRQ 2900 

8701 CCGGCGACTCTCACTGTGGACTGGTTTAATGCAACCAG 8760 

2901 PATLTVDWFNATSSKVNATF 2920 

8761 ACCGCAGCCTCACAGGTGCAGCTGGAGCTGACAGGGGTCTACAAGAAG 8820 

2921 TAASQVQLELTGVYKKEEAH 2940 

8821 CTGCTTCTGAAAGCCTTTCATATGAAAGGCCC^ 8880 

2941 LIiLKAFHI KGPADI FVSKFE 2960 

8881 AATGACAACTGGGGACTCGATGGTTATGTATCCTC^GGACTTGAGAGAGGAGGATTCT 8940 

2961 NDNWGLDGYVSSGLE.RGGFS 2980 

8941 TTTCAGGGTGATATACATGGAAAAGACTTCGG 9000 

2981 FQGDIHGKDFGKFKLERQDP 3000 

9001 TCCAACTCTGATGCAGATTCTTCAAATCATTACCAGGGCACCAGCAGTGGCTCTGTGGCA 9060 

3001 SNSDADSSNHYQGTSSGSVA 3020 
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„ 906?. GCTGCGATTCTCGTCCCCTTCTTCGCTCTAATTCTATCAGGGTTTGCATTTTACCTCTAC 9120 

3021 A A I LVPFFALILSGFAFYLY 3040 

• 9121 AAACACAGAACAAGACCAAAAGTTCAATACAATGGCTATGCTGGCCATGAAAACAGTAAT 9180 

3041 KHRTRPKVQYNGYAGHENSN 3060 

91 8 1 GGACAAGCTTCATTTGAAAACCCCATGTATGATACAAACTTAA7LACCCACAGAGGCCAAG 924 0 

'\ 3061 GQASFENP .MYDTNLKPTEAK 3080 

9241 GCTGTGAGGTTTGACACGACTCTGAACACAGTGTGTACAGTGGTATAGCCCTCAGTGCCC 9300 

3081 AVRFDTTLNTVCTVV* 3096 

9301 CCTAGGACCGACTCATAGCCATACCTCTGATGGACAAGCAGTAAAATCCTTTGGTGCCAT 9360 

9361 ATACCACCCCCTTCTACTCTTACCTTGCTGCAGCAACGTTGGCCATCGTCTGCTGGCATA 9420 

9421 ACGCAGTGGGAATGTCTTCTCCATCATGCCCGAGTCTTCTGAGGATCAAATTGCAAATAC 94 80 

948 1 ACCTTCATCTGGAAAGTGGCTTATAAAAAGCCCGGTTGCTGCATCCACCAGAAATCAAGA 954 0 

9541 CCCCGACAACAGCGAGGGCAAGGAAGACTGCAGAGTCTCCCAGACCGGTGGTACTTAATG 9600 

9601 CCTCTGACTTTTGTGTCTCTGTGTGGCCAGGATGCCTTTGGTGTAGTCTTCTGAGCACAC 9660 

*■ 9661 CGATACATCCCTCAGGTGCGGCGAGAACATGGTAGCCACTTGATGTGTGTTTTGTGTTTT 9720 

9721 TCTGTTTTCTTTTCAAC CCTATCCACTGGACATGAATTCTTTACAAAAGAAAAGCCTTCC 9780 

9781 TGGAGAAGACGCCTTCTGGAAAATGCACACACAGACGCTTTGCTTCTGCCCTGCCTGAGA 9840 

9841 CAGGAGCTCTCCGGATCTTCAGGCTCCACTGGGCGTCCATCAGCCACTAGGGATGTTTGC 9900 

9901 AGATCTCACAGTCAGAGCTGGTCCATCCCAGAGTTTTTTGATGCTCAACATTTTGCACTA 9960 

9961 GTGTGTCAAGAATGACTAAGTCTGATTTCTAAACAAACTATTTCGACAGGGTTTGTATCC 10020 

10021 ACTATACATTGTACATACGCATTTTCTCATACCGTATTCTCAAGCAAATGATGCCACTGT 10080 

10081 CAGTTAAGTTTGGGATGCAAAGGAAGGTCTCCCCGGATACAGAACAGATTTTGAAAAGGA 10140 

10141 GATAGTGCTAGTAATGCTGAAGAAGTTACTCTTTTAATTGCTTCTGTTGGCCACATTTTC 10200 

10201 ATGTCAAATTCATTGCCTACTTCCAGTGGTGGAAATGAAGCCCGTGTATTCCCCTTGGTA 10260 

10261 TCCCCCCACTTCATGTGCATACGACTATTGTCTACACCATACTAATCAATAACAGGGGGC 10320 

10321 TCCAGCAATGTCTGTTTTCCATGTACAGATGTGAATAGTAATTTATTTTAGGTAGCTCAT 10380 

10381 GAACTCAGTTCAC AGTGAAGTCTTCCCTTCCGGATTGTTTCCTTTCTCTGTTTTGTAACA 10440 

10441 TCACCCTCCCAGAATGCATTGAGAGTCTATCTCAGAGCCACACCCAAGCTCAGAGGAATC 10500 

10501 GAAAGGGAAATCAAAGAAGTCCAAATCAGAATCGGAAGGGCAGGCACCGCTCGCACACCC 10560 

10561 TCATGATGATCTGTTTTATAGATTATTTGCCTTTCTGCAAAAAAAAAATCATTACAGTGA 10620 

10621 TTTTTGAAACATTAAAATTCCTTACTGATAGACTATCTATTGTGATATATATAAGATAGG 10680 

10681 TGGTATGGCCAACAGGGATAAAATAAACAGCCTAAAGACAAGGCAGGGCTAGAGAAATGT 10740 

10741 CTGTAAGAAATTTCAAAGAGAAGATCATGTTTATTTTTATTTATATTTGTTTGATAAAAG 10800 

10801 TATTTTTGGAAATATAATGCTTATTTTATTATTTGACGTTTCATGC7VCAGTCCACGTGGT 10860 

10861 AAAAATCCCCCCTTTGTACATCCCAGATTTGCACTGATACATGGGTCAGGATGTCATGCT 10920 

1 0921 GATGTTCTGTTTGCTGTGGTGACTAC^TTCATGCCTAGCTTTAAGACAGGTGGATCTGTC 10980 

10981 TATCTACATGATGTTTAAATGCAGGACTTCCCAGAGGACAGTGGGTAACGGAACATGGCT 11040 

11041 TGCTTGCGGCTTTGGAAGTTCAGCATTCTGAGCGTTCCAGAGGCCCGGCTGGGCTCCCTC 11100 

11101 CTTCTAGCCCACTGTTCTTGCAAGGGCTGTCTGTTGTGTGCCAGGGCTCCTGACTTCTTC 11160 

11161 TGCTGACACTCTGTCCACTGGGTTCCATATTCCAGGACTCCATGTCCTAGGAAAGAGTTT 11220 

11221 TGACATAGGTTCCTCCAGCCAAGCCGACACACATCCACGGGGTTCCTCTGGGCTCCACAG 11280 

11281 AGGTTCTTCATTGGCTCCCTGGGATAAATTCAGATGATGTCAGCAAGAGTGTGCTTCTAT 11340 

11341 ACCACACATTGAGCCAAAACAAAACAGAGAACGTCAGAAGGTCCACGGACCAGAGTGCGC 11400 

11401 AAGGGAGAACAGGGTTACTATATATATTAGATGTATATAAAAACACACACACAAACATAT 11460 

11461 ATATATTGTACATATCTAAGTTTGAGTCACTCAGACTAGGTGCAAAATGCTGACTTTGGA 11520 

11521 GTCTAAACTAACGTCTCTGTCCCCACATCCCTGGCCTCTTTCCTGGCCAGTTACATTAAG 11580 

11581 AAGACTTGACTTAGACAGGGCATACATACATGCAAGGAACCACATCATCAGACCAGTGTC 11640 

11641 GTTTTCCTTTGTGTGCAAACTGACCTACAGCTACCAGACTGCATCATGGTATTTTU^AACC 11700 

11701 AACATACAATATTGAGCGGCACTCTCAGTTGAGAGCCTAGCTCT^ATCCTTCCTAGGANNN 11760 

11761 NNNNNNNNNNl^^ 11820 

11821 NNNNNNNNCACCAGGTC 11880 

11881 CTCAGTGAGGTCTAGACTTTTCCCTGAAGCGCCCAGAGCACAGTGAGGAGTCACGCTCTA 11940 

11941 TGAATGACAGGTTATGTGCTTTGAAGCTGTTCAACTGTTGCTTGTCTTTGCCCATCTTGC 12000 
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12001 CTTC AGG C TAGCTGCAATAATTTTTTTCTTCTGTAAAATATTTTGTAAACAATAACAACA 12060 
12061 ACAACAAAAGCTATTATAAAAAGGGAGAAAAGAAAGCTGGCATTATGATCAGG 12120 
12121 TCCATTCTTGCTGCCCCCCCCCTCCTGTCTCCACC^^ 12180 
12181 CGGAAGACCTTTTTGTACAGAGATATATTTTTTATGAAGAATTTC 12240 
12241 TGCTGTAATTTTTTGATTAATGTAGGTAAATT^ 12300 
12301 AAACTGTAATTTTCCCCCATAATGTAACATTACCCTC^ 12360 
12361 CCTATTCGAACATGTATTAATATTAAGGCGGCCTGTTAAAATGAACAGTATCTTT 12420 
12421 TGTCAAAAAAAATTATAAAGAGAGTGTAACATAACC TGTGTAATGCCAC CTATCTTTAAA 12480 
12481 GCAAATCAGAGTTCTAATTAAATATTTAATTTTAGATTTCAAAAA 12525 
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Figure 4A 

Comparison of Human C3b/C4b Complement Receptor, "h-CR" (SEQ ID 
NO:5) and Human AGP-41773, "41773" (SEQ ID NO:2) 



h-CR 102 KSCRNPPDPVNGMVHVI KGIQFGSQI KYSCTKGYRLI GS S SATCI I SGDT 151 

-•I .|| || • : I'M II III II I 

41773 293 QNCPDPPPFQNGYM . INSDYSVGQSVSFECYPGYILIGHPVLTC . QHGIN 340 

152 VIWDNETPICDRIPCGLPPTITNGDFISTNRENFH. . . YGSWTYRCNPG 198 

I- I II III IN I • « « II 

341 RNWNYPFPRCD . APCGYNVTSQNGTIYSPGFPDEYPILKDCIWLITVPPG 389 

• • • • • 

199 SGGRKVFELVGEPSIYCTSNDDQVGIWSGP. . . APQCI I PNKCTPPNVEN 245 

I II- - -I : =1 II -II : I ,1 

390 HGVYINFTLLQTEAV . . . . .NDYIAVWDGPDQNSPQLGVFSGNTA. .LET 432 

a • • • • 

246 GILVSDNRSLFSLNEWEFRCQPGFVMKGPRRVKCQALNKWEPELPSCSR 295 

I |. I - .| II - II =1 
433 A . YS STNQVLLKFHS . . DF . SNGGFFV LNFHAFQL K 464 

296 VCQPPPDVLHAERTQRDKDNFSPGQEVFYSCEPGYDLRGAASMRC . . TPQ 343 

Mill! II I I • I I MINIM M • I 

465 KCQPPPAVPQAEMLTED.DDFEIGDFVKYQCHPGYTLVGTDILTCKLSSQ 513 
344 GDWSPAAPTCEVKS CDDFMGQLLNGR VLFPV 374 

111! •- I I • 

514 LQFEGSLPTCEAQCPANEVRTGSSGVI LS PGYPGNYFNSQTCS WS I KVEP 563 

375 NLQLGAKVD . FVCDEGFQIi KG S S AS YCVLAGME S 407 

I : II I I II III • 

564 NYNITIFVDTFQSEKQFDALEVFDGSSGQSPLIjWLSGNHTEQSNFTSRS 613 

• • • • • 

408 LWNSSVPVCEQIF CPSPPVI PNGRHTGKPLEVFPFGK 444 

I-. | I Ml : I 

614 NQLYIiRWSTDHATSKKGFKIRYAAPYCSLTHPLKNGGILNRTAGA. .VGS 661 

• • • • • 

445 AVNYTCDPHPDRGTSFDLIGESTIRCTSDPQGNGVWSSPAPRCGILGHCQ 494 

MM : Ml I -II I I I I . I 

662 KVHYFCKP GYRMVGHSNATCRRNPLGMYQWDSLTPLCQAVS . CG 704 

495 APDHFLFAKLKTQTNASDFPIGTSLKYECRPEY. . •YGRPFSITCLDNLV 541 

| : . . : | : • . | | | : . . | : . . 

705 IPE. . . . S PGNGS FTGNEFTLDS KWYECHEGFKLE S SQQATAVCQEDGL 750 

• ♦ » • • 

542 WSS. . PKDVCKRKSCKTPPDPVNGMVHVITDI QVGSRINYSCTT 583 

II- II -I I III ' M-- II- 

751 WSNKGKPPTCKPVAC . . PSIEAQLSEHVIWRLVSGSLNEYGAQVLLSCSP 798 

• • • • • 

584 GHRLIGHSSAECILSGNAAHWS. . TKPPICQRIPCGLPPTIANGDFISTN 631 

hi I I -I h M hi II MM I 

799 GYYLEGWRLLRCQANGT. . . WNIGDERPSCRVISCGSLSFPPNGNKIGTL 845 
632 RENFHYGSWTYRCNPGSGGRKVFELVGEPS I YCTSNDDQVGI WSGPAPQ 681 

Ih MM MM I -I hill 

846 TV. . . YGATAI FTCNTG YTLVGSHVRECLAN . . . : GLWSGSETR 882 
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Figure 4B 

'682 CIXPNKCTPPNVENGILVSDNRSLFSLNEVVEFRCQPGFVMKGPRRVKCQ 731 

h • -1 ' II ' I II = I «.| III : I I 

883 CLAGHCGSPDPIVNGHISGDG. . . FSYRDTWYQCNPGFRLVGTSVRI CIi 929 

732 ALNKWEPEIiPSCSRV.CQPPPDVLHAERTQRDKDNFSPGQEVFYSCEPGY 780 

-II : I I : I I - I - h I -I II 

930 QDHKWSGQTPVCVPITCGHPGNPAHG . . . FTNGSEFNLNDVVNFTCNTGY 976 

781 DLRGAASMRCTPQGDWSPAAPTCEVKSCDD . . FM . GQLIiNGRVLFPVNIjQ 827 

! • ! . -I ! !! !!! ! -I! I- . -h !! - i 

977 UjQGVSRAQCRSNGQWS S PLPTCRWNCSDPGFVENAI RHGQQNFPESFE 1026 

828 IX^KVDFVCDEGFQLKGSSASYCSTLAGMESLWNSSVPVCEQIFCPSPPVI 877 

I « : I -II I I I I I I I I I > I I I II II 

1027 YGMSILYHCKKGFYLLGSSALTCMANG . . . LWDRSLPKCLAISCGHPGVP 1073 

878 PNGRHTGKPLEVFPFX3KTVNYTCDPHPDRGTSFDLIGESTIRCTSDPQGN 927 

I II I - I :| l*M II I III I I I 

1074 ANAVLTG. . . ELFTYGAWHYSC RG . SESLIGNDTRVCQEDSH . . 1112 

928 GVWSSPAPRC. . . . . GILGHCQAPDHFLFAKLKTQTNASDFPIGTSLKYE 972 

II II II II -:| II I- 

1113 . .WSGALPHCTGNNPGFCGDPGTPAH. .GSRIi GDDFKTKSLLRFS 1153 

973 CR. . PE YYGR P FS I TCLDNL VWS S P KDVC KRKS CKTPPDPVNGMVHVI TD 1020 

I - II I I I I II II* II I I ||h 

1154 CEMGHQLRGSP.ERTCLLNGSWSGLQPVCEAVSCGNPGTPTNGMIVSSDG 1202 

1021 IQVGSRINYSCTTGHRLIGHSSAECILSGNAAHWSTKPPICQRIPCGLPP 1070 

I | : |.| |:: | . | .| |- II I I I I 

1203 I LFS SSVI YACWEGYKTSGLMTRHCTANGT . . . WTGTAPDCTI I SCGDPG 1249 

1071 TITVNGDFISTNRENFHYGSWTYRCNPGSGGRKVFEIiVGEPSIYCTSNDD 1120 

hill I • I : h h 1 1 1 1 III -Ml 

1250 TLANGIQFGT. . . DFTFNKTVSYQCNPG YVMEAVTSATIRCTKD. . 1290 

1121 QVGIWSGPAPQCIXPNKCTPPNVENGILVSDNRSLFSLNEWEFRCQPGF 1170 

I h I I II hi I • • I 8 5 I h 

1291 . . GRWNPSKPVCKAVLCPQPPPVQNGTVEGSD . . . FRWGSSISYSCMDGY 1335 

1171 VMKGPRRVKCQAI^KWEPELPSCSRV.CQPPPDVLHAERTQP.r-KDNFSPG 1219 

• h 1 I I I II I : II I • h 

1336 QLSHSAILSCEGRGVWKGEIPQCLPVFCGDPG. . IPAEGRLSGK. SFTYK 1382 

122 0 QEVFYSCEPGYDLRGAASMRCTPQGDWSPAAPTC . . . EVKSCDDFMGQLIj 1266 

III: h « I h I I M HI I I I 

1383 SEVFFQCKSPFILVGSSRRVCQADGTWSGIQPTCIDPAHNTCPD.PGTPH 1431 

1267 NGRVLFPVNLQLGAKVDFVCDEGFQLKGSSASY 1316 

I t.\. | | | - | « -Ih III: h I 

1432 FGIQNSSRGYEVGSTVFFRCRKGYHIQGSTTRTC.LANL. .TWSGIQTEC 1478 
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1317 EQI FCPSPPVI PNGRHTGKPLEVFPFGKAVNYTCDPHPDRGTSFDLIGES 1366 

II I : II • III II I I I I 

1479 I PHACRQPET • P . AHADVRAIDLPTFGYTLVYTC . . HP GFFLAGGS 1520 

1^67 TIR.CTSDPQGNGVWSSPAPRC 1387 

I I -i • I- -I I 

1521 EHRTCKADMK WTGKSPVC 1538 
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SEQUENCE LISTING 

<110> AMGEN, Inc. 

<120> C3B/C4B COMPLEMENT RECEPTOR-LIKE MOLECULES AND USES THEREOF 

<130> 01017/37498 
<140> 

<141> 2001-07-23 

<150> 09/728,787 
<141> 2000-11-28 

<150> US 60/222,504 
<151> 2000-08-02 

<160> 7 

<170> Patentln version 3.0 

<210> 1 

<211> 10673 . 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (334) (9540) 

<400> 1 

cctggggaag cctctcggtt ccaggaaaat gggatggttg attgccctaa attgattttt 60 

taaaagaaaa ttcacgaatt ggcagccata gaatagagta atttctgtaa agcaccagtg 120 

atagtgatgt ttgaatatta atataatgga ccagaggctg tacagtcttt gaaagagggt 180 
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cttgctacct atatatctag ggtttggctg tttaaagcag caagaccctc ctttcaggtg 240 

gaagtcgatg tacttgttgc cttacctaaa agctttgaca tttctctttc ttgcaggctc 300 

acgggatcca gtgttcctga cctcattgtg age atg age aac cag atg tgg eta 354 

Met Ser Asn Gin Met Trp Leu 
1 5 

cat ctg cag teg gat gat age att ggc tea cct ggg ttt aaa get gtt 4 02 

His Leu Gin Ser Asp Asp Ser lie Gly Ser Pro Gly Phe Lys Ala Val 
10 15 20 

tac caa gaa att gaa aag gga ggg tgt ggg gat cct gga ate ccc gee 450 
Tyr Gin Glu He Glu Lys Gly Gly Cys Gly Asp Pro Gly He Pro Ala 
25 30 35 

tat ggg aag egg acg ggc age agt ttc etc cat gga gat aca etc ace 498 
Tyr Gly Lys Arg Thr Gly Ser Ser Phe Leu His Gly Asp Thr Leu Thr 
40 45 50 55 

ttt gaa tgc ccg gcg gee ttt gag ctg gtg ggg gag aga gtt ate acc 546 
Phe Glu Cys Pro Ala Ala Phe Glu Leu Val Gly Glu Arg Val He Thr 
60 65 70 

tgt cag cag aac aat cag tgg tct ggc aac aag ccc age tgt gta ttt 594 
Cys Gin Gin Asn Asn Gin Trp Ser Gly Asn Lys Pro Ser Cys Val Phe 
75 80 85 

tea tgt ttc ttc aac ttt acg gca tea tct ggg att att ctg tea cca 642 
Ser Cys Phe Phe Asn Phe Thr Ala Ser Ser Gly He He Leu Ser Pro 
90 95 100 

aat tat cca gag gaa tat ggg aac aac atg aac tgt gtc tgg ttg att 690 
Asn Tyr Pro Glu Glu Tyr Gly Asn Asn Met Asn Cys Val Trp Leu He 
105 110 115 

ate teg gag cca gga agt cga att cac eta ate ttt aat gat ttt gat 738 
He Ser Glu Pro Gly Ser Arg lie His Leu He Phe Asn Asp Phe Asp 
120 " 125 130 135 

gtt gag cct caa ttt gac ttt etc gcg gtc aag gat gat ggc att tct 786 
Val Glu Pro Gin Phe Asp Phe Leu Ala Val Lys Asp Asp Gly He Ser 
140 " 145 150 

gac ata act gtc ctg ggt act ttt tct ggc aat gaa gtg cct tec cag 834 
Asp He Thr Val Leu Gly Thr Phe Ser Gly Asn Glu Val Pro Ser Gin 
155 160 165 



ctg gee age agt ggg cat ata gtt cgc ttg gaa ttt cag tct gac cat 
Leu Ala Ser Ser Gly His He Val Arg Leu Glu Phe Gin Ser Asp His 
170 175 180 



882 



tec act act ggc aga ggg ttc aac ate act tac acc aca ttt ggt cag 930 
Ser Thr Thr Gly Arg Gly Phe Asn lie Thr Tyr Thr Thr Phe Gly Gin 
185 190 195 

aat gag tgc cat gat cct ggc att cct ata aac gga cga cgt ttt ggt 978 
Asn Glu Cys His Asp Pro Gly He Pro He Asn Gly Arg Arg Phe Gly 
200 205 210 215 

gac agg ttt eta etc ggg age teg gtt tct ttc cac tgt gat gat ggc 1026 
Asp Arg Phe Leu Leu Gly Ser Ser Val Ser Phe His Cys Asp Asp Gly 
220 225 230 
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ttt gtc aag acc cag gga tec gag tec att acc tgc at a ctg caa gac 1074 
Phe Val Lys Thr Gin Gly Ser Glu Ser lie Thr Cys lie Leu Gin Asp 
235 240 245 

ggg aac gtg gtc tgg age tec acc gtg ccc cgc tgt gaa get cca tgt 1122 
Gly Asn Val Val Trp Ser Ser Thr Val Pro Arg Cys Glu Ala Pro Cys 
250 255 260 

99 fc 99^ cat ctg aca gcg tec age gga gtc att ttg cct cct gga tgg 1170 
Gly Gly His Leu Thr Ala Ser Ser Gly Val lie Leu Pro Pro Gly Trp 
265 270 275 

cca gga tat tat aag gat tct tta cat tgt gaa tgg ata att gaa gca 1218 
Pro Gly Tyr Tyr Lys Asp Ser Leu His Cys Glu Trp lie lie Glu Ala 
280 , 285 290 295 

aaa cca ggc cac tct ate aaa ata act ttt gac aga ttt cag aca gag 1266 
Lys Pro Gly His Ser lie Lys lie Thr Phe Asp Arg Phe Gin Thr Glu 
300 305 310 

gtc aat tat gac acc ttg gag gtc aga gat ggg cca gec agt teg tec 1314 
Val Asn Tyr Asp Thr Leu Glu Val Arg Asp Gly Pro Ala Ser Ser Ser 
315 320 325 

cca ctg ate ggc gag tac cac ggc acc cag gca ccc cag ttc etc ate 1362 
Pro Leu lie Gly Glu Tyr His Gly Thr Gin Ala Pro Gin Phe Leu lie 
330 335 340 

age acc ggg aac ttc atg tac ctg eta ttc acc act gac aac age cgc 1410 
Ser Thr Gly Asn Phe Met Tyr Leu Leu Phe Thr Thr Asp Asn Ser Arg 
345 350 355 

tec age ate ggc ttc etc ate cac tat gag agt gtg acg ctt gag teg 1458 
Ser Ser lie Gly Phe Leu lie His Tyr Glu Ser Val Thr Leu Glu Ser 
360 365 370 375 

gat tec tgc ctg gac ccg ggc ate cct gtg aac grc cat cgc cac ggt 1506 
Asp Ser Cys Leu Asp Pro Gly lie Pro Val Asn Xaa His Arg His Gly 
380 385 ^ 390 

gga gac ttt ggc ate agg tec aca gtg act ttc age tgt gac ccg ggg 1554 
Gly Asp Phe Gly lie Arg Ser Thr Val Thr Phe Ser Cys Asp Pro Gly 
395 400 405 

tac aca eta agt gac gac gag ccc etc gtc tgt gag agg aac cac cag 1602 
Tyr Thr Leu Ser Asp Asp Glu Pro Leu Val Cys Glu Arg Asn His Gin 
410 ^ " 415 420 

tgg aac cac gee ttg ccc age tgc gac get eta tgt gga ggc tac ate 1650 
Trp Asn His Ala Leu Pro Ser Cys Asp Ala Leu Cys Gly Gly Tyr lie 
425 430 435 

caa ggg aag agt gga aca gtc ctt tct cct ggg ttt cca gat ttt tat 1698 
Gin Gly Lys Ser Gly Thr Val Leu Ser Pro Gly Phe Pro Asp Phe Tyr 
440 445 450 455 

cca aac tct eta aac ygc acg tgg acc att gaa gtg tct cat ggg aaa 1746 
Pro Asn Ser Leu Asn Xaa Thr Trp Thr lie Glu Val Ser His Gly Lys 
460 465 470 

gga gtt caa atg ate ttt cac acc ttt cat ctt gag agt tec cac gac 1794 
Gly Val Gin Met lie Phe His Thr Phe His Leu Glu Ser Ser His Asp 
475 480 485 
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tat tta ctg ate aca gag gat gga agt ttt tec gag ccc gtt gee agg 1842 
Tyr Leu Leu lie Thr Glu Asp Gly Ser Phe Ser Glu Pro Val Ala Arg 
490 495 500 



etc ace ggg teg gtg ttg cct cat acg ate aag gca ggc ctg ttt gga 1890 
Leu Thr Gly Ser Val Leu Pro His Thr He Lys Ala Gly Leu Phe Gly 
505 510 515 

aac ttc act gec cag ctt egg ttt ata tea gac ttc tea att teg tac 1938 
Asn Phe Thr Ala Gin Leu Arg Phe He Ser Asp Phe Ser He Ser Tyr 
520 525 530 535 

gag ggc ttc aat ate aca ttt tea gaa tat gac ctg gag cca tgt gat 1986 
Glu Gly Phe Asn He Thr Phe Ser Glu Tyr Asp Leu Glu Pro Cys Asp 
540 545 550 

gat cct gga gtc cct gec ttc age cga aga att ggt ttt cac ttt ggt 2034 
Asp Pro Gly Val Pro Ala Phe Ser Arg Arg He Gly Phe His Phe Gly 
555 560 565 

gtg gga gac tct ctg acg ttt tec tgc ttc ctg gga tat cgt tta gaa 2082 
Val Gly Asp Ser Leu Thr Phe Ser Cys Phe Leu Gly Tyr Arg Leu Glu 
570 575 580 

ggt gec rcc aag ctt acc tgc ctg ggt ggg ggc cgc cgt gtg tgg agt 2130 
Gly Ala Xaa Lys Leu Thr Cys Leu Gly Gly Gly Arg Arg Val Trp Ser 
585 590 595 

gca cct ctg cca agg tgt gtg gee gaa tgt gga gca agt gtc aaa gga 2178 
Ala Pro Leu Pro Arg Cys Val Ala Glu Cys Gly Ala Ser Val Lys Gly 
600 605 610 615 

aat gaa gga aca tta ctg tct cca aat ttt cca tec aat tat gat aat 2226 
Asn Glu Gly Thr Leu Leu Ser Pro Asn Phe Pro Ser Asn Tyr Asp Asn 
620 625 630 

aac cat gag tgt ate tat aaa ata gaa aca gaa gee ggc aag ggc ate 2274 
Asn His Glu Cys He Tyr Lys He Glu Thr Glu Ala Gly Lys Gly He 
635 640 645 

cac ctt aga aca cga age ttc cag ctg ttt gaa gga gat act eta aag 2322 
His Leu Arg Thr Arg Ser Phe Gin Leu Phe Glu Gly Asp Thr Leu Lys 
650 655 660 

gta tat gat gga aaa gac agt tec tea cgt cca ctg ggc acg ttc act 2370 
Val Tyr Asp Gly Lys Asp Ser Ser Ser Arg Pro Leu Gly Thr Phe Thr 
665 670 675 

aaa aat gaa ctt ctg ggg ctg ate eta aac age aca tec aat cac ctr 2418 
Lys Asn Glu Leu Leu Gly Leu He Leu Asn Ser Thr Ser Asn His Xaa 
680 685 690 695 

tgg eta gag ttc aac acc aat gga tct gac acc gac caa ggt ttt caa 2466 
Trp Leu Glu Phe Asn Thr Asn Gly Ser Asp Thr Asp Gin Gly Phe Gin 
700 705 710 

etc acc tat acc agt ttt gat ctg gta aaa tgt gag gat ccg ggc ate 2514 
Leu Thr Tyr Thr Ser Phe Asp Leu Val Lys Cys Glu Asp Pro Gly He 
715 720 725 
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cct aac tac ggc tat agg ate cgt gat gaa ggc cac ttt acc gac act 2562 
Pro Asn Tyr Gly. Tyr Arg lie Arg Asp Glu Gly His Phe Thr Asp Thr 
730 735 740 

gta gtt ctg tac agt tgc aac ccg ggg tac gec atg cat ggc age aac 2610 
Val Val Leu Tyr Ser Cys Asn Pro Gly Tyr Ala Met His Gly Ser Asn 
745 750 755 



acc ctg acc tgt ttg agt gga gac agg aga gtg tgg gac aaa cca eta 2658 
Thr Leu Thr Cys Leu Ser Gly Asp Arg Arg Val Trp Asp Lys Pro Leu 
760 . "* 765 770 * * 775 

cct teg tgc ata gcg gaa tgt ggt ggt cag ate cat gca gee aca tea 2706 
Pro Ser Cys lie Ala Glu Cys Gly Gly Gin lie His Ala Ala Thr Ser 
780 785 790 

gga cga ata ttg tec cct ggc tat cca get ccg tat gac aac aac etc 2754 
Gly Arg lie Leu Ser Pro Gly Tyr Pro Ala Pro Tyr Asp Asn Asn Leu 
795 800 805 

cac tgc acc tgg att ata gag gca gac cca gga aag acc att age etc 2802 
His Cys Thr Trp lie lie Glu Ala Asp Pro Gly Lys Thr lie Ser Leu 
810 815 820 

cat ttc att gtt ttc gac acg gag atg get cac gac ate etc aag gtc 2850 
His Phe lie Val Phe Asp Thr Glu Met Ala His Asp lie Leu Lys Val 
825 830 835 

tgg gac ggg ccg gtg gac agt gac ate ctg ctg aag gag tgg agt ggc 2898 
Trp Asp Gly Pro Val Asp Ser Asp lie Leu Leu Lys Glu Trp Ser Gly 
840 845 850 855 

tec gee ctt ccg gag gac ate cac age acc ttc aac tea etc acc ctg 2946 
Ser Ala Leu Pro Glu Asp He His Ser Thr Phe Asn Ser Leu Thr Leu 
860 865 870 

cag ttc gac age gac ttc ttc ate age aag tct ggc ttc tec ate cag 2994 
Gin Phe Asp Ser Asp Phe Phe He Ser Lys Ser Gly Phe Ser He Gin 
875 880 885 

ttc tec acc tea att gca gee acc tgt aac gat cca ggt atg ccc caa 3042 
Phe Ser Thr Ser He Ala Ala Thr Cys Asn Asp Pro Gly . Met Pro Gin 
890 895 900 

aat ggc acc cgc tat gga gac age aga gag get gga gac acc gtc aca 3090 
Asn Gly Thr Arg Tyr GLy Asp Ser Arg Glu Ala Gly Asp Thr Val Thr 
905 910 915 

ttc cag tgt gac cct ggc tat cag etc caa gga caa gee aaa ate acc 3138 
Phe Gin Cys Asp Pro Gly Tyr Gin Leu Gin Gly Gin Ala Lys He Thr 
920 925 930 935 

tgt gtg cag ctg aat aac egg ttc ttt tgg caa cca gac cct cct aca 3186 
Cys Val Gin Leu Asn Asn Arg Phe Phe Trp Gin Pro Asp Pro Pro Thr 
940 945 950 

tgc ata get get tgt gga ggg aat ctg acg ggc cca gca ggt gtt att 3234 
Cys He Ala Ala Cys Gly Gly Asn Leu Thr Gly Pro Ala Gly Val He 
955 960 965 
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ttg tea ccc aac tac cca cag ccg tat cct cct ggg aag gaa tgt gac 
L*6U Ser Pro Asn.Tyr Pro Gin Pro Tyr Pro Pro Gly Lys Glu Cys Asp 
970 975 980 



3282 



tgg aga gta aaa gtg aac ccg gac ttt gtc ate gec ttg ata ttc aaa 
Trp Arg Val Lys Val Asn Pro Asp Phe Val lie Ala Leu He Phe Lys 

985 990 995 

• »- . i 

acjt ttc aac atg gag ccc age tat gac ttc eta cac ate tat gaa 

Ser Phe Asn Met Glu Pro Ser Tyr Asp Phe Leu His He Tyr Glu 
1000 1005 1010 

ggg gaa gat tec aac age ccc etc att ggg agt tac cag ggc tct 

Gly Glu Asp Ser Asn Ser Pro Leu He Gly Ser Tyr Gin Gly Ser 
1015 * 1020 1025 



cag gee cca gaa aga ata 
Gin Ala Pro Glu Arg He 
1030 1035 



gag agt age gga aac age ctg ttt ctg 
Glu Ser Ser Gly Asn Ser Leu Phe Leu 
1040 



gca ttt egg agt gat gee tec gtg ggc ctt tea 
Ala Phe Arg Ser Asp Ala Ser Val Gly Leu Ser 
1045 1050 1055 



ggg ttc gec att 
Gly Phe Ala He 



gaa ttt aaa gag aaa cca egg gaa get tgt ttt gac cca gga aat 
Glu Phe Lys Glu Lys Pro Arg Glu Ala Cys Phe Asp Pro Gly Asn 
1060 "* 1065 1070 



3330 



3375 



3420 



3465 



3510 



3555 



ata atg aat ggg aca aga gtt gga aca gac ttc aag ctt ggc tec 

He Met Asn Gly Thr Arg Val Gly Thr Asp Phe Lys Leu Gly Ser 
1075 1080 1085 

ace ate acc tac cag tgt gac tct ggc tat aag att ctt gac ccc 

Thr He Thr Tyr Gin Cys Asp Ser Gly Tyr Lys He Leu Asp Pro 
1090 1095 1100 



tea tec ate acc tgt gtg 
Ser Ser He Thr Cys Val 
1105 1110 

gac caa gtg ctg ccc tec 
Asp Gin Val Leu Pro Ser 
1120 1125 



att ggg get gat ggg 
He Gly Ala Asp Gly 
1115 

tgc aat get ccc tgt 
Cys Asn Ala Pro Cys 
1130 



acg gga tea gaa ggg gta gtt tta tea cca aac 
Thr Gly Ser Glu Gly Val Val Leu Ser Pro Asn 
1135 1140 1145 



aaa ccc tec tgg 
Lys Pro Ser Trp 



gga ggc cag tac 
Gly Gly Gin Tyr 



tac ccc cat aat 
Tyr Pro His Asn 



tac aca get ggt caa ata 
Tyr Thr Ala Gly Gin He 
1150 1155 



tgc etc tat tec ate acg gta cca aag 
Cys Leu Tyr Ser He Thr Val Pro Lys 
1160 



gaa ttc gtg gtc ttt gga cag ttt gee tat ttc 
Glu Phe Val Val Phe Gly Gin Phe Ala Tyr Phe 
1165 1170 1175 



aat gat ttg gca gaa tta 
Asn Asp Leu Ala Glu Leu 
1180 1185 



ttt gat gga acc cat 
Phe Asp Gly Thr His 
1190 



cag aca gec ctg 
Gin Thr Ala Leu 



gca cag gec aga 
Ala Gin Ala Arg 



3600 



3645 



3690 



3735 



3780 



3825 



3870 



3915 



ctt etc age tea etc teg ggg tct cac tea ggg gaa aca ttg ccc 
Leu Leu Ser Ser Leu Ser Gly Ser His Ser Gly Glu Thr Leu Pro 
1195 1200 1205 



3960 
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ttg get acg tea aat caa att ctg etc cga ttc agt gca aag age 4005 
Leu Ala Thr Ser Asn Gin lie lieu Leu Arg Phe Ser Ala Lys Ser 
1210 1215 1220 

ggt gee tct gee cgc ggc ttc cac ttc gtg tat caa get gtt cct 4050 
Gly Ala Ser Ala Arg Gly Phe His Phe Val Tyr Gin Ala Val Pro 
1225 1230 1235 

cgt ace agt gac ace caa tgc age tct gtc ccc gag ccc aga tac 4095 
Arg Thr Ser Asp Thr Gin Cys Ser Ser Val Pro Glu Pro Arg Tyr 
1240 1245 1250 

gga agg aga att ggt tct gag ttt tct gee ggc tec ate gtc cga .4140 
Gly Arg Arg lie Gly Ser Glu Phe Ser Ala Gly Ser lie Val Arg 
1255 1260 1265 

ttc gag trc aac ccg gga tac ctg ctt cag ggt tec acg gcg etc 4185 
Phe Glu Xaa Asn Pro Gly Tyr Leu Leu Gin Gly Ser Thr Ala Leu 
1270 1275 1280 

cac tgc cag tec gtg ccc aac gec ttg gca cag tgg aac gac acg 4230 
His Cys Gin Ser Val Pro Asn Ala Leu Ala Gin Trp Asn Asp Thr 
1285 1290 1295 

ate ccc age tgt gtg gta ccc tgc agt ggc aat ttc act caa cga 4275 
lie Pro Ser Cys Val Val Pro Cys Ser Gly Asn Phe Thr Gin Arg 
1300 1305 1310 

aga ggt aca ate ctg tec ccc ggc tac cct gag cca tac gga aac 4320 
Arg Gly Thr lie Leu Ser Pro Gly Tyr Pro Glu Pro Tyr Gly Asn 
1315 1320 . 1325 

aac ttg aac tgt ata tgg aag ate ata gtt acg gag ggc teg gga 4365 
Asn Leu Asn Cys lie Trp Lys lie lie Val Thr Glu Gly Ser Gly 
1330 1335 1340 

att cag ate caa gtg ate agt ttt gee acg gag cag aac tgg gac 4410 
He Gin lie Gin Val He Ser Phe Ala Thr Glu Gin Asn Trp Asp 
1345 1350 1355 

tec ctt gag ate cac gat ggt ggg gat gtg ace gca ccc aga ctg 4455 
Ser Leu Glu He His Asp Gly Gly Asp Val Thr Ala Pro Arg Leu 
1360 1365 1370 

gga age ttc tea ggc ace aca gta ccg gca ctg ctg aac agt act 4500 
Gly Ser Phe Ser Gly Thr Thr Val Pro Ala Leu Leu Asn Ser Thr 
1375 1380 1385 

tec aac caa etc tac ctg cat ttc cag tct gac att agt gtg gca 4545 
Ser Asn Gin Leu Tyr Leu His Phe Gin Ser Asp lie Ser Val Ala 
1390 1395 1400 

get get ggt ttc cac ctg gaa tac aaa act gta ggt ctt get gca 4590 
Ala Ala Gly Phe His Leu Glu Tyr Lys Thr Val Gly Leu Ala Ala 
1405 1410 1415 

tgc caa gaa cca gee etc ccc age aac age ate aaa ate gga gat 4635 
Cys Gin Glu Pro Ala Leu Pro Ser Asn Ser lie Lys He Gly Asp 
1420 1425 1430 

egg tac atg gtg aac gac gtg etc tec ttc cag tgc gag ccc ggg 4680 
Arg Tyr Met Val Asn Asp Val Leu Ser Phe Gin Cys Glu Pro Gly 
1435 1440 1445 
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tac acc ctg cag ggc cgt 
Tyr Thr Leu Gin Gly Arg 
1450 1455 



tec cac att tec tgt atg cca ggg acc 
Ser His lie Ser Cys Met Pro Gly Thr 
1460 



gtt cgc cgt tgg aac tat ccg tct ccc ctg tgc 
Val Arg Arg Trp Asn Tyr Pro Ser Pro Leu Cys 
1465 1470 1475 



att gca acc tgt 
lie Ala Thr Cys 



99 a 999 ac 9 ct 9 a 9 c acc 
Gly Gly Thr Leu Ser Thr 
1480 * 1485 



ttg ggt ggt gtg ate ctg age ccc ggc 
Leu Gly Gly Val lie Leu Ser Pro Gly 
1490 



4725 



4770 



4815 



ttc cca ggt tct tac ccc aac aac tta gac tgc acc tgg agg ate 
Phe Pro Gly Ser Tyr Pro Asn Asn Leu Asp Cys Thr Trp Arg lie 
1495 ~ 1500 1505 



tea tta ccc ate ggc tat 
Ser Leu Pro lie Gly Tyr 
1510 1515 



ggt gca cat att cag ttt ctg aat ttt 
Gly Ala His lie Gin Phe Leu Asn Phe 
1520 



tct acc gaa get aat cat gac ttc ctt gaa att 
Ser Thr Glu Ala Asn His Asp Phe Leu Glu lie 
1525 1530 1535 



tac cac acc age ccc atg 
Tyr His Thr Ser Pro Met 
1540 1545 



att gga caa ttt age 
lie Gly Gin Phe Ser 
1550 



caa aat gga cct 
Gin Asn Gly Pro 



ggc acg gat etc 
Gly Thr Asp Leu 



4860 



4905 



4950 



4995 



ccc gcg gee ctg ctg age aca acg cat gaa acc 
Pro Ala Ala Leu Leu Ser Thr Thr His Glu Thr 
1555 1560 1565 



etc ate cac ttt 
Leu lie His Phe 



5040 



tat agt gac cat teg caa 
Tyr Ser Asp His Ser Gin 
1570 1575 



aac egg caa gga ttt aaa ctt get tac 
Asn Arg Gin Gly Phe Lys Leu Ala Tyr 
1580 



5085 



caa gee tat gaa tta cag aac tgt cca gat cca 
Gin Ala Tyr Glu Leu Gin Asn Cys Pro Asp Pro 
1585 1590 1595 



aat ggg tac atg ate aac 
Asn Gly Tyr Met lie Asn 
1600 1605 



teg gat tac age gtg 
Ser Asp Tyr Ser Val 
1610 



ccc cca ttt cag 
Pro Pro Phe Gin 



ggg caa tea gta 
Gly Gin Ser Val 



tct ttc gag tgt tat cct ggg tac att eta ata ggc cat cct gtc 
Ser Phe Glu Cys Tyr Pro Gly Tyr lie Leu lie Gly His Pro Val 
1615 1620 1625 



etc act tgt cag cat ggg 
Leu Thr Cys Gin His Gly 
1630 1635 



ate aac aga aac tgg aac tac cct ttt 
lie Asn Arg Asn Trp Asn Tyr Pro Phe 
1640 



5130 



5175 



5220 



5265 



cca aga tgt gat gee cct tgt ggg tac aac gta 
Pro Arg Cys Asp Ala Pro Cys Gly Tyr Asn Val 
1645 ~ 1650 1655 



act tct cag aac 
Thr Ser Gin Asn 



ggc acc ate tac tec cct 
Gly Thr lie Tyr Ser Pro 
1660 1665 



ggc ttt cct gat gag tat ccg ate ctg 
Gly Phe Pro Asp Glu Tyr Pro lie Leu 
1670 



aag gac tgc att tgg etc ate acg gtg cct cca 
Lys Asp Cys lie Trp Leu lie Thr Val Pro Pro 
1675 1680 1685 



ggg cac gga gtt 
Gly His Gly Val 



5310 



5355 



5400 
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tac ate aac ttc acc ctg tta cag acg gaa get gtc aac gat tac 5445 
Tyr ijLe Asn Phe Thr Leu Leu Gin Thr Glu Ala Val Asn Asp Tyr 
1690 1695 1700 

att get gtt tgg gac ggt ccc gat cag aac tea ccc cag ctg gga 5490 
lie Ala Val Trp Asp Gly Pro Asp Gin Asn Ser Pro Gin Leu Gly 
1705 1710 1715 

gtt ttc agt ggc aac aca gec etc gaa acg gcg tat age tec acc 5535 
Val Phe Ser Gly Asn Thr Ala Leu Glu Thr Ala Tyr Ser Ser Thr 
1720 1725 1730 

aac caa gtc ctg etc aag ttc cac age gac ttt tea aat gga ggc 5580 
Asn Gin Val Leu Leu Lys Phe His Ser Asp Phe Ser Asn Gly- Gly 
1735 . 1740 1745 

ttc ttt gtc etc aat ttc cac gca ttt qag etc aag. aaa tgt caa 5625 
Phe Phe Val Leu Asn Phe His Ala Phe Gin Leu Lys Lys Cys Gin 
1750 1755 1760 

eet ccc cca gcg gtt cca cag gca gaa atg ctt act gag gat gat 5670 
Pro Pro Pro Ala Val Pro Gin Ala Glu Met Leu Thr Glu Asp Asp 
1765 1770 1775 

gat ttc gag ata gga gat ttt gtg aag tac cag tgc cac ccc ggg 5715 
Asp Phe Glu lie Gly Asp Phe Val Lys Tyr Gin Cys His Pro Gly 
1780 1785 1790 

tac acc ttg gtg ggg acc gac att ctg act tgc aag etc agt tec 5760 
Tyr Thr Leu Val Gly Thr Asp lie Leu Thr Cys Lys Leu Ser Ser 
1795 1800 1805 

cag ttg cag ttt gag ggt tct etc cca aca tgt gaa gca caa tgc 5805 
Gin Leu Gin Phe Glu Gly Ser Leu Pro Thr Cys Glu Ala Gin Cys 
1810 1815 1820 

cca gca aat gaa gtc egg act gga tea teg gga gtc att etc agt 5850 
Pro Ala Asn Glu Val Arg Thr Gly Ser Ser Gly Val lie Leu Ser 
1825 1830 1835 

cca ggg tat ccg ggt aat tat ttt aac tec cag act tgc tct tgg 5895 
Pro Gly Tyr Pro Gly Asn Tyr Phe Asn Ser Gin Thr Cys Ser Trp 
1840 ^ 1845 1850 

agt att aaa gtg gaa cca aac tac aac att acc ate ttt gtg gac 5940 
Ser lie Lys Val Glu Pro Asn Tyr Asn lie Thr lie Phe Val Asp 
1855 1860 1865 

aca ttt caa agt gaa aag cag ttt gat gca ctg gaa gtg ttt gat 5985 
Thr Phe Gin Ser Glu Lys Gin Phe Asp Ala Leu Glu Val Phe Asp 
1870 1875 1880 

ggt tct tct ggg caa agt eet ctg eta gta gtc tta agt ggg aat 6030 
Gly Ser Ser Gly Gin Ser Pro Leu Leu Val Val Leu Ser . Gly Asn 
1885 " 1890 1895 

cat act gaa caa tea aat ttt aca age agg agt aat cag tta tat 6075 
His Thr Glu Gin Ser Asn Phe Thr Ser Arg Ser Asn Gin Leu Tyr 
1900 1905 1910 

etc cgc tgg tec act gac cat gee acc agt aag aaa gga ttc aag 6120 
Leu Arg Trp Ser Thr Asp His Ala Thr Ser Lys Lys Gly Phe Lys 
1915 1920 1925 
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att cgc tat gca gca cct 
il : e Afg Tyr Ala Ala Pro 
1930 1935 



tac tgc agt ttg acc cac ccc ctg aag 
Tyr Cys Ser Leu Thr His Pro Leu Lys 
1940 



6165 



aat 999 ggt att eta aac agg act gca gga gcg gtt gga age aaa 
Asn Gly Gly lie Leu Asn Arg Thr Ala Gly Ala Val Gly Ser Lys 
1945 1950 1955 



6210 



gtg cat tat ttt tgc aag 
Val His Tyr Phe Cys Lys 
1960 1965 



cct gga tac cga atg gtc ggc cac age 
Pro Gly Tyr Arg Met Val Gly His Ser 
1970 



6255 



aat gca acc tgt aga cga aac cca ctt ggc atg 
Asn Ala Thr Cys Arg Arg Asn Pro Leu Gly Met 
1975 * ~ 1980 1985 



tac cag tgg gac 
Tyr Gin Trp Asp 



6300 



tec etc acg cca etc tgc 
Ser Leu Thr Pro Leu Cys 
1990 1995 



cag get gtg tec tgt gga ate cca gaa 
Gin Ala Val Ser Cys Gly He Pro Glu 
2000 



6345 



tec cca gga aac ggt tea 
Ser Pro Gly Asn Gly Ser 
2005 2010 



ttt acc ggg aac gag ttc act ttg gac 
Phe Thr Gly Asn Glu Phe Thr Leu Asp 
2015 



agt aaa gtg gtc tat gaa tgt cat gag ggc ttc aag ctt gaa tec 

Ser Lys Val Val Tyr Glu Cys His Glu Gly Phe Lys Leu Glu Ser 
2020 2025 2030 

age cag caa gca aca gee gtg tgt caa gaa gat ggg ctg tgg agt 

Ser Gin Gin Ala Thr Ala Val Cys Gin Glu Asp Gly Leu Trp Ser 
2035 2040 2045 



6390 



6435 



6480 



aac aag ggg aag ccg ccc 
Asn Lys Gly Lys Pro Pro 
2050 2055 



acg tgt aag ccg gtc get tgc ccc age 
Thr Cys Lys Pro Val Ala Cys Pro Ser 
2060 



6525 



att gaa get cag etc tea gaa cat gtc ate tgg agg ctg gtt tea 
He Glu Ala Gin Leu Ser Glu His Val He Trp Arg Leu Val Ser 
2065 2070 2075 



6570 



gga tec ttg aat gag tac ggt get caa gta ttg ctg age tgc agt 
Gly Ser Leu Asn Glu Tyr Gly Ala Gin Val Leu Leu Ser Cys Ser 
2080 2085 2090 



6615. 



cct ggt tac tac tta gaa ggc tgg agg etc ctg egg tgc cag gee 
Pro Gly Tyr Tyr Leu Glu Gly Trp Arg Leu Leu Arg Cys Gin Ala 
2095 2100 2105 



6660 



aat ggg acg tgg aac ata 
Asn Gly Thr Trp Asn He 
2110 2115 



gga gat gag agg cca age tgt cga gtt 
Gly Asp Glu Arg Pro Ser Cys Arg Val 
2120 



6705 



ate teg tgt gga age ctt 
He Ser Cys Gly Ser Leu 
2125 2130 



tec ttt ccc cca aat ggc aac aag att 
Ser Phe Pro Pro Asn Gly Asn Lys He 
2135 



6750 



gga acg ttg aca gtt tat 
Gly Thr Leu Thr Val Tyr 
2140 2145 



ggg gee aca get ata 
Gly Ala Thr Ala He 
2150 



ttt acg tgc aac 
Phe Thr Cys Asn 



6795 



acc ggc tac acg ctt gtg 
Thr Gly Tyr Thr Leu Val 
2155 2160 



ggg tct cat gtc aga 
Gly Ser His Val Arg 
2165 



gag tgc ttg gca 
Glu Cys Leu Ala 



6840 
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aat ggg etc tgg age ggc age gaa act cga tgt ctg get ggc cac 6885 
Asn Gly Leu Trp Ser Gly Ser Glu Thr Arg Cys Leu Ala Gly His 
2170 " 2175 2180 

tgc ggt tec cca gac ccg att gtg aac ggt cac att agt gga gat 6930 
Cys Gly Ser Pro Asp Pro He Val Asn Gly His He Ser Gly Asp 
2*85 2190 2195 

ggc ttc agt tac aga gac acg gtg gtt tac cag tgc aat cct ggt 6975 
Gly Phe Ser Tyr Arg Asp Thr Val Val Tyr Gin Cys Asn Pro Gly 
2200 2205 2210 

ttc egg ctt gtg gga act tec gtg agg ata tgc ctg caa gac cac 7020 
Phe Arg Leu Val Gly Thr Ser Val Arg He Cys Leu Gin Asp His 
2215 2220 2225 

aag tgg tct gga caa acg cct gtc tgt gtc ccc ate aca tgt ggt 7065 
Lys Trp Ser Gly Gin Thr Pro Val Cys Val Pro He Thr Cys Gly 
2230 " 2235 2240 

cac cct gga aac cct gec cac gga ttc act aat ggc agt gag ttc 7110 
His Pro Gly Asn Pro Ala His Gly Phe Thr Asn Gly Ser Glu Phe 
2245 2250 2255 

aac ctg aat gat gtc gtg aat ttc acc tgc aac acg ggc tat ttg 7155 
Asn Leu Asn Asp Val Val Asn Phe Thr Cys Asn Thr Gly Tyr Leu 
2260 2265 2270 

ctg cag ggc gtg tct cga gee cag tgt egg age aac ggc cag tgg 7200 
Leu Gin Gly Val Ser Arg Ala Gin Cys Arg Ser Asn Gly Gin Trp 
2275 2280 2285 

agt age cct ctg ccc acg tgt cga gtg gtg aac tgt tct gat cca 7245 
Ser Ser Pro Leu Pro Thr Cys Arg Val Val Asn Cys Ser Asp Pro 
2290 2295 2300 

ggc ttt gtg gaa aat gec att cgt cac ggg caa cag aac ttc cct 7290 
Gly Phe Val Glu Asn Ala He Arg His Gly Gin Gin Asn Phe Pro 
2305 2310 2315 

gag agt ttt gag tat gga atg agt ate ctg tac cat tgc aag aag 7335 
Glu Ser Phe Glu Tyr Gly Met Ser He Leu Tyr His Cys Lys Lys 
2320 2325 2330 

gga ttt tac ttg ctg gga tct tea gec ttg acc tgt atg gca aat 7380 
Gly phe Tyr Leu Leu Gly Ser Ser Ala Leu Thr Cys Met Ala Asn 
2335 2340 2345 

ggc tta tgg gac cga tec ctg ccc aag tgt ttg get ata teg tgt 7425 
Gly Leu Trp Asp Arg Ser Leu Pro Lys Cys Leu Ala He Ser Cys 
2350 2355 2360 

gga cac cca ggg gtc cct gee aac gec gtc etc act gga gag ctg 7470 
Gly His Pro Gly Val Pro Ala Asn Ala Val Leu Thr Gly Glu Leu 
2365 2370 2375 

ttt acc tat ggc gec gtc gtg cac tac tec tgc aga ggg age gag 7515 
Phe Thr Tyr Gly Ala Val Val His Tyr Ser Cys Arg Gly Ser Glu 
2380 2385 2390 

age etc ata ggc aac gac acg aga gtg tgc cag gaa gac agt cac 7560 
Ser Leu He Gly Asn Asp Thr Arg Val Cys Gin Glu Asp Ser His 
2395 2400 2405 
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tgg age ggg gca ctg ccc cac tgc aca gga aat aat cct gga ttc 7605 
.^rp Ser Gly Ala Leu Pro His Cys Thr Gly Asn Asn Pro Gly Phe 
2*410 ' 2415 2420 

tgt ggt gat ccg ggg acc cca gca cat ggg tct, egg ctt ggt gat 7650 
Cys Gly Asp Pro Gly Thr Pro Ala His Gly Ser Arg Leu Gly Asp 
2425 2430 2435 

.gac ttt aag aca aag agt ctt etc cgc ttc tec tgt gaa atg ggg 7695 
Asp Phe Lys Thr Lys Ser Leu Leu Arg Phe Ser Cys Glu Met Gly 
2440 2445 2450 

cac cag ctg agg ggc tec cct gaa cgc acg tgt ttg etc aat ggg 7740 
His Gin Leu Arg Gly Ser Pro Glu Arg Thr Cys Leu Leu Asn Gly 
2455 2460 2465 

tea tgg tea gga ctg cag ccg gtg tgt gag gee gtg tec tgt ggc 7785 
Ser Trp Ser Gly Leu Gin Pro Val Cys Glu Ala Val Ser Cys Gly 
2470 2475 2480 

aac cct ggc aca ccc acc aac gga atg att gtc agt agt gat ggc 7830 
Asn Pro Gly Thr Pro Thr Asn Gly Met lie Val Ser Ser Asp Gly 
2485 2490 2495 

att ctg ttc tec age teg gtc ate tat gee tgc tgg gaa ggc tac 7875 
lie Leu Phe Ser Ser Ser Val lie Tyr Ala Cys Trp Glu Gly Tyr 
2500 2505 2510 



aag acc tea ggg etc atg aca egg cat tgc aca gee aat ggg acc 
Lys Thr Ser Gly Leu Met Thr Arg His Cys Thr Ala Asn Gly Thr 
2515 2520 2525 



tgg ggc tec age ata agt tac age tgc atg gac ggt tac cag etc 
Trp Gly Ser Ser He Ser Tyr Ser Cys Met Asp Gly Tyr Gin Leu 
2620 2625 2630 



7920 



tgg aca ggc act get ccc gac tgc aca att ata agt tgt ggg gat 7965 
Trp Thr Gly Thr Ala Pro Asp Cys Thr He He Ser Cys Gly Asp 
2530 2535 2540. 

cca ggc aca eta gca aat ggc ate cag ttt ggg acc gac ttc acc 8 010 

Pro Gly Thr Leu Ala Asn Gly He Gin Phe Gly Thr Asp Phe Thr 
2545 2550 2555 

ttc aac aag act gtg age tat cag tgt aac cca ggc tat gtc atg 8055 
Phe Asn Lys Thr Val Ser Tyr Gin Cys Asn Pro Gly Tyr Val Met 
2560 2565 2570 

gaa gca gtc aca tec gee act att cgc tgt acc aaa gac ggc agg 8100 
Glu Ala Val Thr Ser Ala Thr He Arg Cys Thr Lys Asp Gly Arq 
2575 2580 2585 

tgg aat ccg age aaa cct gtc tgc aaa gee gtg ctg tgt cct cag 8145 
Trp Asn Pro Ser Lys Pro Val Cys Lys Ala Val Leu Cys Pro Gin 
2590 2595 2600 

ccg ccg ccg gtg cag aat gga aca gtg gag gga agt gat ttc cgc 8190 
Pro Pro Pro Val Gin Asn Gly Thr Val Glu Gly Ser Asp Phe Arg 
2605 2610 2615 



8235 



tct cac tec gec ate etc tec tgt gaa ggt cgc ggg gtg tgg aaa 8280 
Ser His Ser Ala He Leu Ser Cys Glu Gly Arg Gly Val Trp Lys 
2635 2640 ' 2645 
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gga gag ate ccc cag tgt etc cct gtg ttc tgc 
Gly Glu lie Pro Gin Cys Leu Pro Val Phe Cys 
2650 2655 2660 



gga gac cct ggc 
Gly Asp Pro Gly 



8325 



ate ccc gca gaa ggg cga ctt agt ggg aaa agt 
lie Pro Ala Glu Gly Arg Leu Ser Gly Lys Ser 
2665 2670 2675 



ttc acc tat aag 
Phe Thr Tyr Lys 



8370 



tec gaa gtc ttc ttc cag tgc aaa tct cca ttt ata etc gtg gga 
Ser Glu Val Phe Phe Gin Cys Lys Ser Pro Phe lie Leu Val Gly 
2680 2685 2690 



8415 



tec tec aga aga gtc tgc caa get gac ggc acg 
Ser Ser Arg Arg Val Cys Gin Ala Asp Gly Thr 
2695 2700 2705 



tgg. age ggc ata 
Trp Ser Gly lie 



8460 



caa ccc acc tgc att gat cct get cat aac acc tgc cca gac cct 
Gin Pro Thr Cys He Asp Pro Ala His Asn Thr Cys Pro Asp Pro 
2710 " 2715 2720 



8505 



ggt acg cca cac ttt gga ata cag aat age tec 
Gly Thr Pro His Phe Gly He Gin Asn Ser Ser 
2725 2730 2735 



aga ggc tat gag 
Arg Gly Tyr Glu 



8550 



gtt gga age acg gtt ttt 
Val Gly Ser Thr Val Phe 
2740 2745 



ttc agg tgc aga aaa 
Phe Arg Cys Arg Lys 
2750 



ggc tac cat att 
Gly Tyr His He 



8595 



caa ggt tec acg act cgc acc tgc ctt gee aat tta aca tgg agt 
Gin Gly Ser Thr Thr Arg Thr Cys Leu Ala Asn Leu Thr Trp Ser 
2755 2760 2765 



8640 



ggg ata cag acc gaa tgt ata cct cat gee tgc aga cag cca gaa 
Gly He Gin Thr Glu Cys He Pro His Ala Cys Arg Gin Pro Glu 
2770 2775 2780 



8685 



acc ccg gca cac gcg gat gtg aga gee ate gat ctt cct act ttc 
Thr Pro Ala His Ala Asp Val Arg Ala He Asp Leu Pro Thr Phe 
2785 2790 " 2795 



8730 



ggc tac acc tta gtg tac acc tgc cat cca ggc 
Gly Tyr Thr Leu Val Tyr Thr Cys His Pro Gly 
2800 2805 2810 



ttt ttc etc gca 
Phe Phe Leu Ala 



8775 



ggg gga tct gag cac aga aca tgt aaa gca gac atg aaa tgg aca 
Gly Gly Ser Glu His Arg Thr Cys Lys Ala Asp Met Lys Trp Thr 
2815 2820 2825 



8820 



gga aag teg cct gtg tgt aaa agt aaa gga gtg aga gaa gtt aat 
Gly Lys Ser Pro Val Cys Lys Ser Lys Gly Val Arg Glu Val Asn 
2830 2835 " 2840 



8865 



gaa aca gtt act aaa act cca gtt cct tea gat 
Glu Thr Val Thr Lys Thr Pro Val Pro Ser Asp 
2845 2850 2855 



gtc ttt ttc gtc 
Val Phe Phe Val 



8910 



aat tea ctg tgg aag ggg tat tat gaa tat tta ggg aaa aga caa 
Asn Ser Leu Trp Lys Gly Tyr Tyr Glu Tyr Leu Gly Lys Arg Gin 
2860 2865 2870 



8955 



ccc gee act eta act gtt gac tgg ttc aat gca aca age agt aag 
Pro Ala Thr Leu Thr Val Asp Trp Phe Asn Ala Thr Ser Ser Lys 
2875 2880 2885 



9000 
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gtg aat gcc acc ttc age gaa gec teg cca gtg gag ctg aag ttg 9045 
Val Asn Ala Thr Phe Ser Glu Ala Ser Pro Val Glu Leu Lys Leu 
2890 2895 2900 

aca ggc att tac aag aag gag gag gcc cac tta etc ctg aaa get 9090 
Thr Gly lie Tyr Lys Lys Glu Glu Ala His Leu Leu Leu Lys Ala 
2905 2910 2915 

ttt caa att aaa ggc cag gca gat att ttt gta age aag ttc gaa 9135 
Phe Gin lie Lys Gly Gin Ala Asp lie Phe Val Ser Lys Phe Glu 
2920 2925 * 2930 

aat gac aac tgg gga eta gat ggt tat gtg tea tct gga ctt gaa 9180 
Asn Asp Asn Trp Gly Leu Asp Gly Tyr Val Ser Ser Gly Leu Glu 
2935 2940 2945 

a 9 a 99 a gga ttt act ttt caa ggt gac att cat gga aaa gac ttt 9225 
Arg Gly Gly Phe Thr Phe Gin Gly Asp lie His Gly Lys Asp Phe 
2950 '. 2955 2960 

gga aaa ttt aag eta gaa agg caa gat cct tta aac cca gat caa 9270 
Gly Lys Phe Lys Leu Glu Arg Gin Asp Pro Leu Asn Pro Asp Gin 
2965 2970 2975 

gac tct tec agt cat tac cac ggc acc age agt ggc tct gtg gcg 9315 
Asp Ser Ser Ser His Tyr His Gly Thr Ser Ser Gly Ser Val Ala 
2980 2985 2990 

get gcc att ctg gtt cct ttc ttt get eta att tta tea ggg ttt 9360 
Ala Ala lie Leu Val Pro Phe Phe Ala Leu lie Leu Ser Gly Phe 
2995 3000 3005 

gca ttt tac etc tac aaa cac aga acg aga cca aaa gtt caa tac 94 05 

Ala Phe Tyr Leu Tyr Lys His Arg Thr Arg Pro Lys Val Gin Tyr 
3010 3015 3020 

aat ggc tat get ggg cat gaa aac age aat gga caa gca teg ttt 9450 
Asn Gly Tyr Ala Gly His Glu Asn Ser Asn Gly Gin Ala Ser Phe 
3025 3030 3035 

gaa aac ccc atg tat gat aca aac tta aaa ccc aca gaa gcc aag 9495 
Glu Asn Pro Met Tyr Asp Thr Asn Leu Lys Pro Thr Glu Ala Lys 
3040 3045 3050 

get gtg agg ttt gac aca act ctg aac aca gtc tgt aca gtg gta 9540 
Ala Val Arg Phe Asp Thr Thr Leu Asn Thr Val Cys Thr Val Val 
3055 3060 3065 

tagccctcag tgccccaaca ggactgattc atagecatae ctctgatgga caagcagtga 9600 
ttcctttggt gccatatacc actctcccyt ccactctggc tttactgcag cgatcttcaa 9660 
ccttgtctac tggcataagt geagegggga tctctactca aatgtgtcag ggtcttctac 9720 
ggatcaaact acacatgegt tttcattcca aaagtgggtt etaaatgect ggctgeatet 9780 
gtatgaaatc aaggcacact ccaggaagac tgccacgtcg cgccaacacg tcatactcaa 984 0 
trcctcagac tttcatattt ctgtgttgct gagatgeett teaatgeaat cgtctgggct 9900 
cgtggatatg tccctcaggt gcggtgacag aatggtggca ccacgatatg tgttctcttg 9960 
tgttgttttt cctttttaaa cccccatgaa cacgaatact ctgaaaaaaa taaaaagctt 10020 
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tctggaagaa 


gacacctttc 


tgatagaggc 


tcacacctac 


aaatgcttca 


ctctgtcctt 


10080 


ccgagacctg 


acaagctttg 


aggacctcac 


agctcccctg 


tgtgttcatc 


tctagggatg 


10140 


tt.tgcaattt 


cccagtcagc 


tgttctgtcg 


cagaatgttt 


aatgcacaat 


tttttgcact 


10200 


agtgtgttat 


gaatgactaa 


gattctgata 


aaaaaaataa 


attatttaca 


cagggtttat 


10260 


acacactatc 


cattgtatat 


aagcattatt 


tcatattatc 


aagctaaaca 


ttcccccatc 


10320 


agcttagttg 


gagtgttagg 


gaaaagtatt 


cctagatatg 


gcacagattt 


taaaaggaaa 


10380 


tacagtattg 


acgagattta 


ttttattatt 


gcttcaatta 


gctccattta 


cgtgttgaat 


10440 


tcattgaaga 


ggtccaatga 


gaaaaaaaca 


gaagcctcct 


tatttcacac 


gttttcctcc 


10500 


tttagtacca 


tcctcatcca 


attactgtct 


ctctgatact 


acttaatagc 


agggggtttg 


10560 


cagaaatttc 


tgtttgccat 


gtaaaactgt 


gaatagtaat 


ttattttaga 


tagtcgatga 


10620. 


aqttgtgggt 


tttagctcac 


aatgcagcct 


tcccttttgc 


agtgtttttt 


ttt 


10673 



<210> 2 

<211> 3069 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Ser Asn Gin Met Trp Leu His Leu Gin Ser Asp Asp Ser He Gly 
1 5 10 15 

Ser Pro Gly Phe Lys Ala Val Tyr Gin Glu He Glu Lys Gly Gly Cys 
20 25 30 

Gly Asp Pro Gly He Pro Ala Tyr Gly Lys Arg Thr Gly Ser Ser Phe 
35 40 45 

Leu His Gly Asp Thr Leu Thr Phe Glu Cys Pro Ala Ala Phe Glu Leu 
50 55 60 

Val Gly Glu Arg Val He Thr Cys Gin Gin Asn Asn Gin Trp Ser Gly 
65 70 75 80 

Asn Lys Pro Ser Cys Val Phe Ser Cys Phe Phe Asn Phe Thr Ala Ser 
85 90 95 - 



Ser Gly He 



He Leu Ser Pro Asn Tyr Pro Glu 
100 105 



Glu Tyr Gly Asn Asn 
110 
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Met Asn Cys Val Trp Leu He He Ser Glu Pro Gly Ser Arg He His 
, 115 120 125 



Leu He Phe Asn Asp Phe Asp Val Glu Pro Gin Phe Asp Phe Leu Ala 
130 135 140 

Val Lys Asp Asp Gly He Ser Asp He Thr Val Leu Gly Thr Phe Ser 
145 * ~ 150 155 160 

Gly Asn Glu Val Pro Ser Gin Leu Ala Ser Ser Gly His He Val Arg 
165 170 175 

«. 

Leu Glu Phe Gin Ser Asp His Ser Thr Thr Gly Arg Gly Phe Asn He 
180 185 190 

Thr Tyr Thr Thr Phe Gly Gin Asn Glu Cys His Asp Pro Gly He Pro 
195 200 205 

He Asn Gly Arg Arg Phe Gly Asp Arg Phe Leu Leu Gly Ser Ser Val 
210 215 220 

Ser Phe His Cys Asp Asp Gly Phe Val Lys Thr Gin Gly Ser Glu Ser 
225 230 235 240 

He Thr Cys He Leu Gin Asp Gly Asn Val Val Trp Ser Ser Thr Val 
245 250 255 

Pro Arg Cys Glu Ala Pro Cys Gly Gly His Leu Thr Ala Ser Ser Gly 
260 265 270 

Val He Leu Pro Pro Gly Trp Pro Gly Tyr Tyr Lys Asp Ser Leu His 
275 280 285 

Cys Glu Trp He He Glu Ala Lys Pro Gly His Ser He Lys He Thr 
290 295 300 

Phe Asp Arg Phe Gin Thr Glu Val Asn Tyr Asp Thr Leu Glu Val Arg 
305 " 310 315 320 

Asp Gly Pro Ala Ser Ser Ser Pro Leu He Gly Glu Tyr His Gly Thr 
325 330 335 

Gin Ala Pro Gin Phe Leu He Ser Thr Gly Asn Phe Met Tyr Levi Leu 
340 345 350 



Phe Thr Thr Asp Asn Ser Arg Ser Ser He Gly Phe Leu He His Tyr 
355 360 365 
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Glu Ser Val Thr Leu Glu Ser Asp Ser Cys Leu Asp Pro Gly He Pro 
370 375 380 



Val Asn Xaa His Arg His Gly Gly Asp Phe Gly He Arg Ser Thr Val 
385 390 395 400 



Thr Phe Ser Cys Asp Pro Gly Tyr Thr Leu Ser Asp Asp Glu Pro Leu 
405 410 415 



Vai Cys Glu Arg Asn His uin xrp Asn His Ala Leu fro Ser uys Asp 
420 425 430 



Ala Leu Cys Gly Gly Tyr He Gin Gly Lys Ser Gly Thr Val Leu Ser 
435 440 445 



Pro Gly Phe Pro Asp Phe Tyr Pro Asn Ser Leu Asn Xaa Thr Trp Thr 
450 455 460 

lie Glu Val Ser His Gly Lys Gly Val Gin Met He Phe His Thr Phe 
465 470 475 480 



His Leu Glu Ser Ser His Asp Tyr Leu Leu He Thr Glu Asp Gly Ser 
485 490 495 

Phe Ser Glu Pro Val Ala Arg Leu Thr Gly Ser Val Leu Pro His Thr 
^ 500 505 510 

He Lys Ala Gly Leu Phe Gly Asn Phe Thr Ala Gin Leu Arg Phe He 
515 520 525 

Ser Asp Phe Ser He Ser Tyr Glu Gly Phe Asn He Thr Phe Ser Glu 
530 535 540 

Tyr Asp Leu Glu Pro Cys Asp Asp Pro Gly Val Pro Ala Phe Ser Arg 
545 & 550 555 560 



Arg He Gly Phe His Phe Gly Val Gly Asp Ser Leu Thr Phe Ser Cys 
565 570 575 

Phe Leu Gly Tyr Arg Leu Glu Gly Ala Xaa Lys Leu Thr Cys Leu Gly 
580 585 590 

Gly Gly Arg Arg Val Trp Ser Ala Pro Leu Pro Arg Cys Val Ala Glu 
595 600 605 

Cys Gly Ala Ser Val Lys Gly Asn Glu Gly Thr Leu Leu Ser Pro Asn 
610 615 620 
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Phe Pro Ser Asn Tyr Asp Asn Asn His Glu Cys lie Tyr Lys lie Glu 
625 630 635 640 

Thr Glu Ala Gly Lys Gly lie His Leu Arg Thr Arg Ser Phe Gin Leu 
645 650 655 

Phe Glu Gly Asp Thr Leu Lys Val Tyr Asp Gly Lys Asp Ser Ser Ser 
660 665 670 

Arg Pro Leu Gly Thr Phe Thr Lys Asn Glu Leu Leu Gly Leu lie Leu 
675 680 685 

Asn Ser Thr Ser Asn His Xaa Trp Leu Glu Phe Asn Thr Asn Gly Ser 
690 695 700 

Asp Thr Asp Gin Gly Phe Gin Leu Thr Tyr Thr Ser Phe Asp Leu Val 
705 710 715 720 

Lys Cys Glu Asp Pro Gly lie Pro Asn Tyr Gly Tyr Arg lie Arg Asp 
725 . 730 735 

Glu Gly His Phe Thr Asp Thr Val Val Leu Tyr Ser Cys Asn Pro Gly 
740 745 750 

Tyr Ala Met His Gly Ser Asn Thr Leu Thr Cys Leu Ser Gly Asp Arg 
755 760 765 

Arg Val Trp Asp Lys Pro Leu Pro Ser Cys lie Ala Glu Cys Gly Gly 
770 775 780 

Gin lie His Ala Ala Thr Ser Gly Arg He Leu Ser Pro Gly Tyr Pro 
785 790 795 800 

Ala Pro Tyr Asp Asn Asn Leu His Cys Thr Trp He He Glu Ala Asp 
805 810 815 

Pro Gly Lys Thr He Ser Leu His Phe He Val Phe Asp Thr Glu Met 
820 825 830 

Ala His Asp He Leu Lys Val Trp Asp Gly Pro Val Asp Ser Asp lie 
835 840 845 

Leu Leu Lys Glu Trp Ser Gly Ser Ala Leu Pro Glu Asp He His Ser 
850 * 855 860 



Thr Phe Asn Ser Leu Thr Leu Gin Phe Asp Ser Asp Phe Phe He Ser 
865 870 875 880 
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Lys Ser Gly Phe Ser lie Gin Phe Ser Thr Ser He Ala Ala Thr Cys 
J 885 890 895 

Asn Asp Pro Gly Met Pro Gin Asn Gly Thr Arg Tyr Gly Asp Ser Arg 
900 905 910 

*■ * i 

Glu Ala Gly Asp Thr Val Thr Phe Gin Cys Asp Pro Gly Tyr Gin Leu 
915 920 925 

Gin Gly Gin Ala Lys He Thr Cys Val Gin Leu Asn Asn Arg Phe Phe 
930 935 940 

i. 

Trp Gin Pro Asp Pro Pro Thr Cys He Ala Ala Cys Gly Gly Asn Leu 
945 950 955 960 

Thr Gly Pro Ala Gly Val He Leu Ser Pro Asn Tyr Pro Gin Pro Tyr 
965 970 975 

Pro Pro Gly Lys Glu Cys Asp Trp Arg Val Lys Val Asn Pro Asp Phe 
980 985 990 

Val He Ala Leu He Phe Lys Ser Phe Asn Met Glu Pro Ser Tyr Asp 
995 1000 1005 

Phe Leu His He Tyr Glu Gly Glu Asp Ser Asn Ser Pro Leu He 
1010 1015 1020 

Gly Ser Tyr Gin Gly Ser Gin Ala Pro Glu Arg He Glu Ser Ser 
1025 1030 1035 

Gly Asn Ser Leu Phe Leu Ala Phe Arg Ser Asp Ala Ser Val Gly 
1040 1045 1050 

Leu Ser Gly Phe Ala He Glu Phe Lys Glu Lys Pro Arg Glu Ala 
1055 1060 1065 

Cys Phe Asp Pro Gly Asn He Met Asn Gly Thr Arg Val Gly Thr 
1070 1075 1080 

Asp Phe Lys Leu Gly Ser Thr He Thr Tyr Gin Cys Asp Ser Gly 
1085 1090 1095 

Tyr Lys He Leu Asp Pro Ser Ser He Thr Cys Val He Gly Ala 
1100 1105 1110 



Asp Gly Lys Pro Ser Trp Asp Gin Val Leu Pro Ser Cys Asn Ala 
* 115 1120 1125 
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Pro Cys Gly Gly Gin Tyr Thr Gly Ser Glu Gly Val Val Leu Ser 
1130 1135 1140 



Pro Asn Tyr Pro His Asn Tyr Thr Ala Gly Gin lie Cys Leu Tyr 
1145 1150 1155 

Ser lie Thr Val Pro Lys Glu Phe Val Val Phe Gly Gin Phe Ala 
1160 1165 1170 



Tyr Phe Gin Thr Ala Leu Asn Asp Leu Ala Glu Leu Phe Asp Gly 
1175 1180 1185 

Thr His Ala Gin Ala Arg Leu Leu Ser Ser Leu Ser Gly Ser His 
1190 1195 1200 



Ser Gly Glu Thr Leu Pro Leu Ala Thr Ser Asn Gin lie Leu Leu 
1205 1210 1215 

Arg Phe Ser Ala Lys Ser Gly Ala Ser Ala Arg Gly Phe His Phe 
1220 1225 1230 



Val Tyr Gin Ala Val Pro Arg Thr Ser Asp Thr Gin Cys Ser Ser 
1235 1240 1245 



Val Pro Glu Pro Arg Tyr Gly Arg Arg lie Gly Ser Glu Phe Ser 
1250 ~ 1255 1260 



Ala Gly Ser lie Val Arg Phe Glu Xaa Asn Pro Gly Tyr Leu Leu 
1265 1270 1275 



Gin Gly Ser Thr Ala Leu His Cys Gin Ser Val Pro Asn Ala Leu 
1280 1285 1290 



Ala Gin Trp Asn Asp Thr lie Pro Ser Cys Val Val Pro Cys Ser 
1295 1300 1305 



Gly Asn Phe Thr Gin Arg Arg Gly Thr lie Leu Ser Pro Gly Tyr 
1310 1315 1320 



Pro Glu Pro Tyr Gly Asn Asn Leu Asn Cys lie Trp Lys lie lie 
1325 " 1330 1335 



Val Thr Glu Gly Ser Gly He Gin He Gin Val He Ser Phe Ala 
1340 "* 1345 1350 



Thr Glu Gin Asn Trp Asp Ser 
1355 1360 



Leu Glu He His Asp Gly Gly Asp 
1365 
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Val Thr Ala Pro Arg Leu Gly Ser Phe Ser Gly Thr Thr Val Pro 
1370 1375 1380 



Ala Leu Leu Asn Ser Thr Ser Asn Gin Leu Tyr Leu His Phe Gin 
1385 1390 1395 



Ser Asp He Ser Val Ala Ala Ala Gly Phe His Leu Glu Tyr Lys 

1400 1405 1410 

Thr Val Gly Leu Ala Ala Cys Gin Glu Pro Ala Leu Pro Ser Asn 

1415 1420 1425 

Ser He Lys He Gly Asp Arg Tyr Met Val Asn Asp Val Leu Ser 

1430 1435 1440 

Phe Gin Cys Glu Pro Gly Tyr Thr Leu Gin Gly Arg Ser His He 

1445 1450 1455 



Ser Cys Met Pro Gly Thr Val Arg Arg Trp Asn Tyr Pro Ser Pro 
1460 1465 1470 



Leu Cys He Ala Thr Cys Gly Gly Thr Leu Ser Thr Leu Gly Gly 
1475 1480 1485 

Val He Leu Ser Pro Gly Phe Pro Gly Ser Tyr Pro Asn Asn Leu 
1490 1495 1500 



Asp Cys Thr Trp Arg He Ser Leu Pro He Gly Tyr Gly Ala His 
1505 1510 1515 

He Gin Phe Leu Asn Phe Ser Thr Glu Ala Asn His Asp Phe Leu 
1520 1525 1530 

Glu He Gin Asn Gly Pro Tyr His Thr Ser Pro Met He Gly Gin 
1535 1540 1545 

Phe Ser Gly Thr Asp Leu Pro Ala Ala Leu Leu Ser Thr Thr His 
1550 1555 1560 

Glu Thr Leu He His Phe Tyr Ser Asp His Ser Gin Asn Arg Gin 
1565 1570 1575 

Gly Phe Lys Leu Ala Tyr Gin Ala Tyr Glu Leu Gin Asn Cys Pro 
1580 1585 1590 



Asp Pro Pro Pro Phe Gin Asn Gly Tyr Met He Asn Ser Asp Tyr 
1595 1600 1605 
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Ser Val Gly Gin Ser Val Ser Phe Glu Cys Tyr Pro Gly Tyr lie 
160.0 1615 1620 

Leu lie Gly His Pro Val Leu Thr Cys Gin His Gly lie Asn Arg 
1625 1630 1635 



Asn Trp Asn Tyr Pro Phe Pro Arg Cys Asp Ala Pro Cys Gly Tyr 
1640 1645 1650 



Asn Val Thr Ser Gin Asn Gly Thr lie Tyr Ser Pro Gly Phe Pro 
1655 1660 1665 



Asp Glu Tyr Pro lie Leu Lys Asp Cys lie Trp Leu lie Thr Val 
1670 1675 1680 



Pro Pro Gly His Gly Val Tyr lie Asn Phe Thr Leu Leu Gin Thr 
1685 1690 1695 



Glu Ala Val Asn Asp Tyr lie Ala Val Trp Asp Gly Pro Asp Gin 
1700 1705 1710 



Asn Ser Pro Gin Leu Gly Val Phe Ser Gly Asn Thr Ala Leu Glu 
1715 1720 1725 



Thr Ala Tyr Ser Ser Thr Asn Gin Val Leu Leu Lys Phe His Ser 
1730 1735 1740 



Asp Phe Ser Asn Gly Gly Phe Phe Val Leu Asn Phe His Ala Phe 
1745 1750 1755 



Gin Leu Lys Lys Cys Gin Pro Pro Pro Ala Val Pro Gin Ala Glu 
1760 1765 1770 



Met Leu Thr Glu Asp Asp Asp Phe Glu lie Gly Asp Phe Val Lys 
1775 1780 1785 



Tyr Gin Cys His Pro Gly Tyr Thr Leu Val Gly Thr Asp lie Leu 
1790 1795 1800 



Thr Cys Lys Leu Ser Ser Gin Leu Gin Phe Glu Gly Ser Leu Pro 
1805 1810 1815 



Thr Cys Glu Ala Gin Cys Pro Ala Asn Glu Val Arg Thr Gly Ser 
1820 1825 1830 



Ser Gly Val lie Leu Ser Pro Gly Tyr Pro Gly Asn Tyr Phe Asn 
1835 1840 ^ "* , 1845 
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Ser Gin Thr Cys Ser Trp Ser He Lys Val Glu Pro Asn Tyr Asn 
1850 1855 1860 



He Thr He Phe Val Asp Thr Phe Gin Ser Glu Lys Gin Phe Asp 
1865 1870 1875 



Ala Leu Glu Val Phe Asp Gly Ser Ser Gly Gin Ser Pro Leu Leu 
1880 1885 1890 



Val Val Leu Ser Gly Asn His Thr Glu Gin Ser Asn Phe Thr Ser 
1895 1900 1905 



Arg Ser Asn Gin Leu Tyr Leu Arg Trp Ser Thr Asp His Ala Thr 
1910 1915 1920 



Ser Lys Lys Gly Phe Lys He Arg Tyr Ala Ala Pro Tyr Cys Ser 
1925 1930 1935 



Leu Thr His Pro Leu Lys Asn Gly Gly He Leu Asn Arg Thr Ala 
1940 1945 1950 



Gly Ala Val Gly Ser Lys Val His Tyr Phe Cys Lys Pro Gly Tyr 
1955 1960 1965 



Arg Met Val Gly His Ser Asn Ala Thr Cys Arg Arg Asn Pro Leu 
1970 1975 1980 



Gly Met Tyr Gin Trp Asp Ser Leu Thr Pro Leu Cys Gin Ala Val 
1985 1990 1995 



Ser Cys Gly He Pro Glu Ser Pro Gly Asn Gly Ser Phe Thr Gly 
2000 2005 2010 



Asn Glu Phe Thr Leu Asp Ser Lys Val Val Tyr Glu Cys His Glu 
2015 2020 2025 



Gly Phe Lys Leu Glu Ser Ser Gin Gin Ala Thr Ala Val Cys Gin 
2030 2035 2040 



Glu Asp Gly Leu Trp Ser Asn Lys Gly Lys Pro Pro Thr Cys Lys 
2045 2050 * 2055 



Pro Val Ala Cys Pro Ser He Glu Ala Gin Leu Ser Glu His Val 
2060 2065 2070 



He Trp Arg Leu Val Ser Gly Ser Leu Asn Glu Tyr Gly Ala Gin 
2075 2080 2085 
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Val Leu Leu Ser Cys Ser Pro Gly Tyr Tyr Leu Glu Gly Tip Arg 
2090 2095 2100 



Leu Leu Arg Cys Gin Ala Asn Gly Thr Trp Asn lie Gly Asp Glu 
2105 2110 2115 



Arg Pro Ser Cys Arg Val lie Ser Cys Gly Ser Leu Ser Phe Pro 
2120 " . 2125 2130 



Pro Asn Gly Asn Lys lie Gly Thr Leu Thr Val Tyr Gly Ala Thr 
2135 2140 2145 



Ala lie Phe Thr Cys Asn Thr Gly Tyr Thr Leu Val Gly Ser His 
2150 2155 2160 



Val Arg Glu Cys Leu Ala Asn Gly Leu Trp Ser Gly Ser Glu Thr 
2165 2170 2175 



Arg Cys Leu Ala Gly His Cys Gly Ser Pro Asp Pro He Val Asn 
2180 2185 2190 



Gly His He Ser Gly Asp Gly Phe Ser Tyr Arg Asp Thr Val Val 
2195 2200 2205 



Tyr Gin Cys Asn Pro Gly Phe Arg Leu Val Gly Thr Ser Val Arg 
2210 ~ 2215 ~ 2220 



He Cys Leu Gin Asp His Lys Trp Ser. Gly Gin Thr Pro Val Cys 
2225 2230 2235 



Val Pro He Thr Cys Gly His Pro Gly Asn Pro Ala His Gly Phe 
2240 2245 2250 



Thr Asn Gly Ser Glu Phe Asn Leu Asn Asp Val Val Asn Phe Thr 
2255 2260 2265 



Cys Asn Thr Gly Tyr Leu Leu Gin Gly Val Ser Arg Ala Gin Cys 
2270 2275 2280 



Arg Ser Asn Gly Gin Trp Ser Ser Pro Leu Pro Thr Cys Arg Val 
2285 2290 2295 



Val Asn Cys Ser Asp Pro Gly Phe Val Glu Asn Ala He Arg His 
2300 2305 2310 



Gly Gin Gin Asn Phe Pro Glu Ser Phe Glu Tyr Gly Met Ser He 
2315 2320 2325 
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Leu Tyx His Cys Lys Lys Gly Phe Tyr Leu Leu Gly Ser Ser Ala 
2330 2335 2340 



Leu Thr Cys Met Ala Asn Gly Leu Trp Asp Arg Ser Leu Pro Lys 
2345 2350 2355 



Cys Leu Ala He Ser Cys Gly His Pro Gly Val Pro Ala Asn Ala 
~' 2360 2365 2370 



Val Leu Thr Gly Glu Leu Phe Thr Tyr Gly Ala Val Val His Tyr 
2375 2380 2385 



Ser Cys Arg Gly Ser Glu Ser Leu He Gly Asn Asp Thr Arg Val 
2390 2395 2400 



Cys Gin Glu Asp Ser His Trp Ser Gly Ala Leu Pro His Cys Thr 
2405 2410 2415 



Gly Asn Asn Pro Gly Phe Cys Gly Asp Pro Gly Thr Pro Ala His 
24.20 2425 ' 2430 



Gly Ser Arg Leu Gly Asp Asp Phe Lys Thr Lys Ser Leu Leu Arg 
2435 ~ 2440 2445 



Phe Ser Cys Glu Met Gly His Gin Leu Arg Gly Ser Pro Glu Arg 
2450 2455 2460 



Thr Cys Leu Leu Asn Gly Ser Trp Ser Gly Leu Gin Pro Val Cys 
2465 2470 2475 



Glu Ala Val Ser Cys Gly Asn Pro Gly Thr Pro Thr Asn Gly Met 
2480 2485 2490 



He Val Ser Ser Asp Gly He Leu Phe Ser Ser Ser Val He Tyr 
2495 2500 2505 



Aia Cys Trp Glu Gly Tyr Lys Thr Ser Gly Leu Met Thr Arg His 
2510 2515 2520 



Cys Thr Ala Asn Gly Thr Trp Thr Gly Thr Ala Pro Asp Cys Thr 
2525 2530 2535 



He He Ser Cys Gly Asp Pro Gly Thr Leu Ala Asn Gly He Gin 
2540 2545 2550 



Phe Gly Thr Asp Phe Thr Phe Asn Lys Thr Val Ser Tyr Gin Cys 
2555 2560 2565 
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Asn Pro Gly Tyr Val Met Glu Ala Val Thr Ser Ala Thr lie Arg 
:-jz: 257;0 2575 2580 



Cys Thr Lys Asp Gly Arg Trp Asn Pro Ser Lys Pro Val Cys Lys 
2585 2590 2595 



Ala Val Leu Cys Pro Gin Pro Pro Pro Val Gin Asn Gly Thr Val 
~ 2600 2605 2610 



Glu Gly Ser Asp Phe Arg Trp Gly Ser Ser lie Ser Tyr Ser Cys 
2615 2620 2625 



Met Asp Gly Tyr Gin Leu Ser His Ser Ala lie Leu Ser Cys Glu 
2630 2635 2640 



Gly Arg Gly Val Trp Lys Gly Glu lie Pro Gin Cys Leu Pro Val 
2645 2650 2655 



Phe Cys Gly Asp Pro Gly lie Pro Ala Glu Gly Arg Leu Ser Gly 
2660 2665 2670 



Lys Ser Phe Thr Tyr Lys Ser Glu Val Phe Phe Gin Cys Lys Ser 
2675 2680 2685 



Pro Phe lie Leu Val Gly Ser Ser Arg Arg Val Cys Gin Ala Asp- 
2690 2695 2700 



Gly Thr Trp Ser Gly lie Gin Pro Thr Cys He Asp Pro Ala His 
2705 2710 2715 



Asn Thr Cys Pro Asp Pro Gly Thr Pro His Phe Gly He Gin Asn 
2720 2725 2730 



Ser Ser Arg Gly Tyr Glu Val Gly Ser Thr Val Phe Phe Arg Cys 
2735 2740 2745 



Arg Lys Gly Tyr His Tie Gin Gly Ser Thr Thr Arg Thr Cys Leu 
2750 " 2755 " 2760 



Ala Asn Leu Thr Trp Ser Gly He Gin Thr Glu Cys He Pro His 
2765 2770 2775 



Ala Cys Arg Gin Pro Glu Thr Pro Ala His Ala Asp Val Arg Ala 
2780 2785 2790 



He Asp Leu Pro Thr Phe Gly Tyr Thr Leu Val Tyr Thr Cys His 
2795 2800 2805 
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Pro Gly Phe Phe Leu Ala Gly Gly Ser Glu His Arg Thr Cys Lys 
2810 2815 2820 



Ala Asp Met Lys Trp Thr Gly Lys Ser Pro Val Cys Lys Ser Lys 
2825 2830 2835 



Gly Val Arg Glu Val Asn Glu Thr Val Thr Lys Thr Pro Val Pro 
2840 2845 2850 



ser Asp Vai Phe Phe val Asn Ser Leu Trp Lys Gly Tyr Tyr Glu 
2855 2860 2865 



Tyr Leu Gly Lys Arg Gin Pro Ala Thr Leu Thr Val Asp Trp Phe 
2870 2875 2880 



Asn Ala Thr Ser Ser Lys Val Asn Ala Thr Phe Ser Glu Ala Ser 
2885 2890 2895 



Pro Val Glu Leu Lys Leu Thr Gly lie Tyr Lys Lys Glu Glu Ala 
2900 2905 " 2910 



His Leu Leu Leu Lys Ala Phe Gin lie Lys Gly Gin Ala Asp lie 
2915 2920 2925 



Phe Val Ser Lys Phe Glu Asn Asp Asn Trp Gly Leu Asp Gly Tyr 
2930 2935 2940 



Val Ser Ser Gly Leu Glu Arg Gly Gly Phe Thr Phe Gin Gly Asp 
2945 2950 2955 



lie His Gly Lys Asp Phe Gly Lys Phe Lys Leu Glu Arg Gin Asp 
2960 2965 2970 



Pro Leu Asn Pro Asp Gin Asp Ser Ser Ser His Tyr His Gly Thr 
2975 2980 2985 



Ser Ser Gly Ser Val Ala Ala Ala lie Leu Val Pro Phe Phe Ala 
2990 2995 3000 



Leu He Leu Ser Gly Phe Ala Phe Tyr Leu Tyr Lys His Arg Thr 
3005 3010 3015 



Arg Pro Lys Val Gin Tyr Asn Gly Tyr Ala Gly His Glu Asn Ser 
3020 3025 ^ 3030 



Asn Gly Gin Ala Ser Phe Glu Asn Pro Met Tyr Asp Thr Asn Leu 
3035 3040 3045 



WO 02/10199 



PCT/US01/23232 



-28- 

Lys Pro Thr Glu Ala Lys Ala Val Arg Phe Asp Thr Thr Leu Asn 
3050 " 3055 3060 

Thr Val Cys Thr Val Val 
3065 

<210> 3 

<211> 12525 

<212> DNA 

<213> . Rattus rattus 



<220> 

<r221> CDS 

<222> (1) . . (9285) 

<223> N = any amino acid 



<400> 3 

gat gcc ggg aag gtg ggg gac acc aga tec gtc ttg tac gtg ctt aca 48 
Asp Ala Gly Lys Val Gly Asp Thr Arg Ser Val Leu Tyr Val Leu Thr 
1 5 10 15 

ggc tec agt gtc cct gac etc ate gtg age atg age aat cag atg tgg 96 
Gly Ser Ser Val Pro Asp Leu lie Val Ser Met Ser Asn Gin Met Trp 
20 25 30 

etc cac ctg cag tea gac gac age att ggt tec cca gga ttt aaa get 144 
Leu His Leu Gin Ser Asp Asp Ser lie Gly Ser Pro Gly Phe Lys Ala 
35 40 45 

gtg tac caa gaa ate gag aag gga ggc tgc ggg gac cct ggc ate cca 192 
Val Tyr Gin Glu lie Glu Lys Gly Gly Cys Gly Asp Pro Gly lie Pro 
50 55 60 

gcc tac ggg aag egg act ggc age age ttc ttg cac ggg gac acg etc 240 
Ala Tyr Gly Lys Arg Thr Gly Ser Ser Phe Leu His Gly Asp Thr Leu 
65 70 75 80 

acc ttt gag tgc cag gca get ttt gag ctg gta gga gag aga gtg att 288 
Thr Phe Glu Cys Gin Ala Ala Phe Glu Leu Val Gly Glu Arg Val lie 
85 90 95 

acg tgc cag aga aac aac cag tgg tec ggc aac aag cca age tgt gtg 336 
Thr Cys Gin Arg Asn Asn Gin Trp Ser Gly Asn Lys Pro Ser Cys Val 
100 105 110 

ttt tea tgt ttc ttc aac ttc acg gcg tec tct ggg ate ate ctg teg 384 
Phe Ser Cys Phe Phe Asn Phe Thr Ala Ser Ser Gly lie lie Leu Ser 
115 120 125 
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cca aac tat cct gag gaa tat ggc aac aac atg aat tgt gtg tgg ttg 432 
Pro Asn Tyr Pro Glu Glu Tyr Gly Asn Asn Met Asn Cys Val Trp Leu 
130 135 140 

att ata tct gag ccc ggg age egg att cac etc ate ttc aat gat ttc 480 
lie lie Ser Glu Pro Gly Ser Arg He His Leu He Phe Asn Asp Phe 
145 150 155 160 

gat gtg gag cct cag ttt gac ttc ctt gcg gtc aaa gat gat ggg att 528 
Asp Val Glu Pro Gin Phe Asp Phe Leu Ala Val Lys Asp Asp Gly He 
165 170 ~ 175 

tct gac ate aca gtc etc ggg act ttc tct ggc aat gag gtg cct gca 576 
Ser Asp He Thr Val Leu Gly Thr Phe Ser Gly Asn Glu Val Pro Ala 
180 185 190 

cag ctg gec ngc agt gga cac ata gta cgc ctg gag ttt cag tec gat 624 
Gin Leu Ala Xaa Ser Gly His lie Val Arg Leu Glu Phe Gin Ser Asp 
195 200 205 

cac tct ace acg ggc aga ggg ttc aac ate ata tac ace aca ttt ggt 672 
His Ser Thr Thr Gly Arg Gly Phe Asn He He Tyr Thr Thr Phe Gly 
210 215 220 

cag aac gag tgt cat gac cct ggg ate cct gtg aat gga egg cgc ttt 720 
Gin Asn Glu Cys His Asp Pro Gly He Pro Val Asn Gly Arg Arg Phe 
225 230 235 ~ 240 

gga gac agg ttt ctg ctg gga agt tct gtg tec % ttc cac tgt gat gat 768 
Gly Asp Arg Phe Leu Leu Gly Ser Ser Val Ser Phe His Cys Asp Asp 
245 250 255 

ggc ttt gtg aag act cag ggt tct gag tct ate aca tgc ate ttg caa 816 
Gly Phe Val Lys Thr Gin Gly Ser Glu Ser He Thr Cys He Leu Gin 
260 265 270 

gat gga aac gtg gtc tgg age tct act gtc cct cgc tgt gaa get cct 864 
Asp Gly Asn Val Val Trp Ser Ser Thr Val Pro Arg Cys Glu Ala Pro 
275 280 285 

tgt ggt ggg cat ctg aca get tct agt ggg gtc ata tta cct cca gga 912 
Cys Gly Gly His Leu Thr Ala Ser Ser Gly Val He Leu Pro Pro Gly 
290 295 300 

tgg cca gga tat tac aaa gat tct tta aat tgc gaa tgg gtc att gaa 960 
Trp Pro Gly Tyr Tyr Lys Asp Ser Leu Asn Cys Glu Trp Val He Glu 
305 310 315 320 

gee aaa cca gga cat tec ate aaa ata aca ttt gac agg ttc cag aca 1008 
Ala Lys Pro Gly His Ser He Lys He Thr Phe Asp Arg Phe Gin Thr 
325 330 335 

gaa gtc aat tat gat act ctg gaa gtc egg gat ggg cca ace age tea 1056 
Glu Val Asn Tyr Asp Thr Leu Glu Val Arg Asp Gly Pro Thr Ser Ser 
340 345 350 

tec cca ctg att ggg gag tac cat ggc ace cag get cca cag ttc etc 1104 
Ser Pro Leu He Gly Glu Tyr His Gly Thr Gin Ala Pro Gin Phe Leu 
355 360 365 

ate age aca ggg aac tac atg tac ctg ctg ttt ace act gac age age 1152 
He Ser Thr Gly Asn Tyr Met Tyr Leu Leu Phe Thr Thr Asp Ser Ser 
370 375 380 
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cgc get agt gtt ggc ttc etc ate cac tat gag agt gtg act ctt gaa 1200 
Arg Ala Ser Val Gly Phe Leu lie His Tyr Glu Ser Val Thr Leu Glu 
385 390 395 400 

tct gac tec tgt ctg gac ccg ggc ate cct gta aat ggt cat egg cat 1248 
Ser Asp Ser Cys Leu Asp Pro Gly lie Pro Val Asn Gly His Arg His 
405 410 415 

ggc agt aac ttt ggt ate aga tct aca gtg acc ttc age tgt gac cct 1296 
Gly Ser Asn Phe Gly lie Arg Ser Thr Val Thr Phe Ser Cys Asp Pro 
420 425 430 

ggg tac acg etc agt gat gac gat ccc etc ate tgt gag aag aac cat 13.44 
Gly Tyr Thr Leu Ser Asp Asp Asp Pro Leu lie Cys Glu Lys Asn His 
435 -* 440 445 

cag tgg aac cac gee ttg ccc age tgt gat gee ctg tgt gga ggc tac 1392 
Gin Trp Asn His Ala Leu Pro Ser Cys Asp Ala Leu Cys Gly Gly Tyr 
450 455 460 

ate cat gga aag agt ggg act gtt ctt tea cca gga ttt cca gac ttt 1440 
lie His Gly Lys Ser Gly Thr Val Leu Ser Pro Gly Phe Pro Asp Phe 
465 470 475 480 

tat cca aac tct ctg aac tgt aca tgg acc att gaa gtc tct cat ggc 1488 
Tyr Pro Asn Ser Leu Asn Cys Thr Trp Thr He Glu Val Ser His Gly 
485 490 495 

aag gga gtg cag atg aat ttc cac acc ttt cac ctt gaa agt tec cac 1536 
Lys Gly Val Gin Met Asn Phe His Thr Phe His Leu Glu Ser Ser His 
500 505 510 

gac tat ttg ctg ate aca gag gat ggg agt ttc tea gag ccg gta gee 1584 
Asp Tyr Leu Leu He Thr Glu Asp Gly Ser Phe Ser Glu Pro Val Ala 
515 520 525 

agg etc act ggg teg gtc ctg cct cac acc att aag get ggc ttg ttt 1632 
Arg Leu Thr Gly Ser Val Leu Pro His Thr He Lys Ala Gly Leu Phe 
530 535 540 

gga aac ttc act gcg caa etc agg ttc ate tct gac ttc tec ate tec 1680 
Gly Asn Phe Thr Ala Gin Leu Arg Phe He Ser Asp Phe Ser He Ser 
545 550 555 560 

tat gaa ggc ttc aac att acg ttt gca gaa tat gac eta gaa ccc tgt 1728 
Tyr Glu Gly Phe Asn He Thr Phe Ala Glu Tyr Asp Leu Glu Pro Cys 
565 570 575 

gat gac cct gga gtc cct gee tac agt cgc aga att ggg ttc cag ttc 1776 
Asp Asp Pro Gly Val Pro Ala Tyr Ser Arg Arg He Gly Phe Gin Phe 
580 585 590 

ggt gtg ggt gac acc ctg get ttc acc tgc ttc cag gga tac cgc tta 1824 
Gly Val Gly Asp Thr Leu Ala Phe Thr Cys Phe Gin Gly Tyr Arg Leu 
595 600 605 

gaa ggt gca acc aag ctt acc tgc ctg ggt ggg gga cgc cga gtg tgg 1872 
Glu Gly Ala Thr Lys Leu Thr Cys Leu Gly Gly Gly Arg Arg Val Trp 
610 615 620 

agt gca cct ctg cca agg tgt gtg get gaa tgt gga gca age gtc aaa 1920 
Ser Ala Pro Leu Pro Arg Cys Val Ala Glu Cys Gly Ala Ser Val Lys 
625 630 635 640 
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gga aat gaa gga aca tta etc tct cca aat ttc cca tec aat tat gat 1968 

Gly Asn Glu Gly Thr Leu lieu Ser Pro Asn Phe Pro Ser Asn Tyr Asp 
645 650 655 

aat aac cat gag tgt ate tat aaa ata gaa aca gaa gec gga aag ggg 2016 
Asn Asn His Glu Cys lie Tyr Lys lie Glu Thr Glu Ala Gly Lys Gly 
660 665 670 

ate cat etc aga gee cga ace ttc caa etc ttc gaa gga gac act eta 2064 
lie His Leu Arg Ala Arg Thr Phe Gin Leu Phe Glu Gly Asp Thr Leu 
675 680 685 

aag gtt tat gat gga aag gac age tec teg agg tea ctg gga gtc ttc 2112 
Lys Val Tyr Asp Gly Lys Asp Ser Ser Ser Arg Ser Leu Gly Val Phe 
690 695 700 

aca aga agt gaa ctg atg ggg ctg gtg eta aac age acc tec aac cac 2160 
Thr Arg Ser Glu Leu Met Gly Leu Val Leu Asn Ser Thr Ser Asn His 
705 710 715 720 

ctg agg ctg gag ttc aac tct aac ggg tea gat acc gee caa ggc ttc 2208 
Leu Arg Leu Glu Phe Asn Ser Asn Gly Ser Asp Thr Ala Gin Gly Phe 
725 730 735 

cag etc acc tac acc agt ttt gac eta gtg aaa tgt gag gat cca ggc 2256 
Gin Leu Thr Tyr Thr Ser Phe Asp Leu Val Lys Cys Glu Asp Pro Gly 
740 745 750 

ate cct aac tat ggc tac agg ate cga gat gat ggt cac ttc aca gac 2304 
lie Pro Asn Tyr Gly Tyr Arg lie Arg Asp Asp Gly His Phe Thr Asp 
755 760 765 

act gtg gtt etc tac age tgc aac cca ggc tac gca atg cat ggc age 2352 
Thr Val Val Leu Tyr Ser Cys Asn Pro Gly Tyr Ala Met His Gly Ser 
770 775 780 

agt acc ctg acc tgc ctg agt ggg gac cga agg gtg tgg gac aaa cct 2400 
Ser Thr Leu Thr Cys Leu Ser Gly Asp Arg Arg Val Trp Asp Lys Pro 
.785 790 795 * " 800 

atg cct tec tgt gtg gcg gaa tgt ggt ggt etc gtc cat gca gee aca 2448 
Met Pro Ser Cys Val Ala Glu Cys Gly Gly Leu Val His Ala Ala Thr 
805 810 815 

tea gga cgc ata etc tct cct ggc tac cct gee cca tat gac aac aac 2496 
Ser Gly Arg lie Leu Ser Pro Gly Tyr Pro Ala Pro Tyr Asp Asn Asn 
820 825 830 

ctt cat tgc act tgg acc ata gag get gat cct ggc aag acc aye age 2544 
Leu His Cys Thr Trp Thr lie Glu Ala Asp Pro Gly Lys Thr Xaa Ser 
835 840 845 

etc cat ttc att gtg ttt gac act gaa acg gcg cac gac ate etc aag 2592 
Leu His Phe lie Val Phe Asp Thr Glu Thr Ala His Asp lie Leu Lys 
850 855 860 

gtc tgg gat ggt cca gtg gac age aac ate ctg ctg aag gag tgg age 2640 
Val Trp Asp Gly Pro Val Asp Ser Asn lie Leu Leu Lys Glu Trp Ser 
865 870 875 880 

ggc teg gee ctt cct gag gac ate cac age acc ttc aac teg etc acc 2688 
Gly Ser Ala Leu Pro Glu Asp lie His Ser Thr Phe Asn Ser Leu Thr 
885 890 895 
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ctg cag ttc gat agt gac ttc ttc ate age aag tec ggc ttc tec ate 
;Leu Gin Phe Asp Ser Asp Phe Phe lie Ser Lys Ser Gly Phe Ser lie 
900 905 910 



2736 



cag ttc tct act tec att gca tec ace tgc aat gac cct ggg atg cct 
Gin Phe Ser Thr Ser lie Ala Ser Thr Cys Asn Asp Pro Gly Met Pro 
915 920 925 



2784 



cag aat gga acc cgc tat ggt gac age egg gaa cct gga gac ace ate 
Gin Asn Gly Thr Arg Tyr Gly Asp Ser Arg Glu Pro Gly Asp Thr He 
930 935 940 



2832 



acc ttc cag tgt gac cct gga tac cag etc caa ggg caa gec aag ate 2880 
Thr Phe Gin Cys Asp Pro Gly Tyr Gin Leu Gin Gly Gin Ala Lys He 
945 , 950 955 960 

act tgt gtg cag ctt aac aac cgc ttc ttc tgg caa cca gac cct ccg 2928 
Thr Cys Val Gin Leu Asn Asn Arg Phe Phe Trp Gin Pro Asp Pro Pro 
965 970 975 

tea tgc ata get get tgt ggt ggg aat ctg aca ggc cct get gga gtg 2976 
Ser Cys He Ala Ala Cys Gly Gly Asn Leu Thr Gly Pro Ala Gly Val 
980 985 990 



att tta tec cca aac tac cca cag cca tac cct cct ggg aag gag tgt 
He Leu Ser Pro Asn Tyr Pro Gin Pro Tyr Pro Pro Gly Lys Glu Cys 
995 1000 1005 



gac tgg aga att aag gtg aac cca gac ttt gtc att 
Asp Trp Arg He Lys Val Asn Pro Asp Phe Val He 
1010 1015 1020 



gee tta ata 
Ala Leu He 



3024 



3069 



ttc aaa agt ttt age atg gag cca agt tac gac ttc ctg cat ate 

Phe Lys Ser Phe Ser Met Glu Pro Ser Tyr Asp Phe Leu His He 
1025 1030 1035 

tat gaa ggg aag gac tec aac age cca ctg ate gga age ttc cag 

Tyr Glu Gly Lys Asp Ser Asn Ser Pro Leu He Gly Ser Phe Gin 
1040 1045 1050 



3114 



3159 



ggt tct caa gec cca gag agg att gag age agt ggt aac age etc 
Gly Ser Gin Ala Pro Glu Arg He Glu Ser Ser Gly Asn Ser Leu 
1055 1060 1065 



ttc ctg gca ttc agg agt gat gee tct gtt ggc ctg 
Phe Leu Ala Phe Arg Ser Asp Ala Ser Val Gly Leu 
1070 1075 1080 



tec ggg ttt 
Ser Gly Phe 



3204 



3249 



gee att gaa ttt aaa gag aaa cca egg gaa get tgc 
Ala He Glu Phe Lys Glu Lys Pro Arg Glu Ala Cys 
1085 , 1090 ~ 1095 



ttt gac cct 
Phe Asp Pro 



3294 



ggg aac ata atg aac ggg aca agg att gga acg gac ttt aag ctg 
Gly Asn He Met Asn Gly Thr Arg He Gly Thr Asp Phe Lys Leu 
1100 1105 1110 



3339 



ggc tct aca gtt acc tat caa tgt gac tct ggt tac 

Gly Ser Thr Val Thr Tyr Gin Cys Asp Ser Gly Tyr 
1115 1120 ~ 1125 

gat ccc tea tec att gag tgt gtg aca ggg get gat 

Asp Pro Ser Ser He Glu Cys Val Thr Gly Ala Asp 
1130 1135 * 1140 



aag att gtg 
Lys He Val 



ggg aag ccg 
Gly Lys Pro 



3384 



3429 
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tcc tgg gac egg gca ctg cct gec tgc caa gca ccc tgt gga ggc 3474 
Ser Trp Asp Arg Ala Leu Pro Ala Cys Gin Ala Pro Cys Gly Gly 
1145 1150 1155 

caa tac atg ggc teg gag ggg gta gtt ttg tea cca aac tac cct 3519 
Gin Tyr Met Gly Ser Glu Gly Val Val Leu Ser Pro Asn Tyr Pro 
. 1160 1165 1170 

cat aac tac acg get ggg cag at a tgc ate tat tec ate acg gtg 3564 
His Asn Tyr Thr.Ala Gly Gin He Cys He Tyr Ser He Thr Val 
1175 1180 1185 

ccc aag gaa ttt gtg gtg ttt gga cag ttt gec tat ttc cag act 3609 
Pro Lys Glu Phe Val Val Phe Gly Gin Phe Ala Tyr Phe Gin Thr 
1190 1195 1200 

gcg ctg aac gac ttg gca gaa ttg ttt gat gga acc cat cct cag 3654 
Ala Leu Asn Asp Leu Ala Glu Leu Phe Asp Gly Thr His Pro Gin 
1205 1210 1215 

gee agg ctt etc agt tct etc tct ggt tec cat tea ggt gaa aca 3699 
Ala Arg Leu Leu Ser Ser Leu Ser Gly Ser His Ser Gly Glu Thr 
1220 1225 1230 

etc ccg ctg get aca tec aat cag att ctg ctt cgc ttc age gca 3744 
Leu Pro Leu Ala Thr Ser Asn Gin He Leu Leu Arg Phe Ser Ala 
1235 1240 1245 

aag age gga get tct gca egg ggt ttc cac ttc gtc tac caa gee 3789 
Lys Ser Gly Ala Ser Ala Arg Gly Phe His Phe Val Tyr Gin Ala 
1250 1255 1260 

gtc cca cgc acc agt gac acg cag tgc age tec gtc cct gag ccc 3834 
Val Pro Arg Thr Ser Asp Thr Gin Cys Ser Ser Val Pro Glu Pro 
1265 1270 1275 

aga tat ggg aga agg att ggt tct gag ttc tct gca ggc tec ate 3879 
Arg Tyr Gly Arg Arg He Gly Ser Glu Phe Ser Ala Gly Ser lie 
1280 1285 1290 

gtc cga ttc gag tgc aac cca ggt tac ctg ctg caa ggc tec aca 3924 
Val Arg Phe Glu Cys Asn Pro Gly Tyr Leu Leu Gin Gly Ser Thr 
1295 * 1300 1305 

gec ate cgt tgt cag tct gtg cca aac get ttg gec cag tgg aat 3969 
Ala He Arg Cys Gin Ser Val Pro Asn Ala Leu Ala Gin Trp Asn 
1310 1315 1320 

gac acc ate eca age tgt gta gtt cca tgc agt ggc aat ttc act 4014 
Asp Thr He Pro Ser Cys Val Val Pro Cys Ser Gly Asn Phe Thr 
1325 1330 1335 

cag aga aga ggg aca ate tta tct cca ggc tac cct gag ccc tat 4059 
Gin Arg Arg Gly Thr He Leu Ser Pro Gly Tyr Pro Glu Pro Tyr 
1340 1345 1350 

ggg aac aac ctg aac tgt gta tgg aag ate ata gta teg gag ggc 4104 
Gly Asn Asn Leu Asn Cys Val Trp Lys He He Val Ser Glu Gly 
1355 1360 1365 

tea ggg ate cag ate caa gtg att age ttt gec acg gag cag aac 414 9 

Ser Gly He Gin He Gin Val He Ser Phe Ala Thr Glu Gin Asn 
1370 1375 1380 
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tgg gac tec ctg gag ate cat gac gga gga gac atg acg gec ccc 
Trp Asp Ser Leu Glu He His Asp Gly Gly Asp Met Thr Ala Pro 
1385 1390 1395 



4194 



aga ctg ggc age ttc tea ggt ace aca gtg ccc gca ctg ctg aat 
Arg Leu Gly Ser Phe Ser Gly Thr Thr Val Pro Ala Leu Leu Asn 
1400 1405 1410 



4239 



age acc tec aac cag etc tgc ctg cac ttc cag teg 
Ser Thr Ser Asn Gin Leu Cys Leu His Phe Gin Ser 
1415 1420 1425 



gac ate agt 
Asp He Ser 



4284 



gtt gee get gcg ggc ttt cac 
Val Ala Ala Ala Gly Phe His 
1430 1435 



ctg gaa tac aaa acg 
Leu Glu Tyr Lys Thr 
1440 



gtg ggt ctg 
Val Gly Leu 



4329 



get gcg tgc cag gaa cct get etc ccg age aac ggc ate aag at a 
Ala Ala Cys Gin Glu Pro Ala Leu Pro Ser Asn Gly He Lys He 
1445 1450 1455 



4374 



gga gac cgc tat atg gtg aac gat gtg ctg tec ttc 
Gly Asp Arg Tyr Met Val Asn Asp Val Leu Ser Phe 
1460 ~ 1465 1470 



cag tgc gag 
Gin Cys Glu 



4419 



cct ggg tac acc ttg cag ggc cgc tea cac att tct tgt atg ccg 
Pro Gly Tyr Thr Leu Gin Gly Arg Ser His He Ser Cys Met Pro 
1475 1480 1485 



4464 



gga act gta cgt cgc tgg aac tat cct tec cct ctg tgc att gee 
Gly Thr Val Arg Arg Trp Asn Tyr Pro Ser Pro Leu Cys He Ala 
1490 ~ 1495 1500 



4509 



acc tgt ggt ggg aca ctg acc 
Thr Cys Gly Gly Thr Leu Thr 
1505 1510 



age atg agt gga gtg 
Ser Met Ser Gly Val 
1515 



ate ctg age 
He Leu Ser 



4554 



cca ggc ttc cca ggg tea tac ccc aac aac ctg gac tgc acc tgg 
Pro Gly Phe Pro Gly Ser Tyr Pro Asn Asn Leu Asp Cys Thr Trp 
1520 " 1525 1530 



4599 



aag ata tec ctg ccc att ggc tat ggt gca cat ate 
Lys He Ser Leu Pro He Gly Tyr Gly Ala His He 
1535 1540 1545 



caa ttt ctg 
Gin Phe Leu 



4644 



aat ttc tea act gaa gee aac cat gac tac ctg gag ate cag aat 
Asn Phe Ser Thr Glu Ala Asn His Asp Tyr Leu Glu He Gin Asn 
1550 1555 " 1560 



4689 



ggc cct tac cac agt agt cca atg atg gga cag ttc agt ggc cct 
Gly Pro Tyr His Ser Ser Pro Met Met Gly Gin Phe Ser Gly Pro 
1565 1570 1575 



4734 



gac ctg cct gcg tea ctg ctg 
Asp Leu Pro Ala Ser Leu Leu 
1580 1585 



age acc aca cat gaa 
Ser Thr Thr His Glu 
1590 



acc etc ate 
Thr Leu He 



4779 



cgc ttc tat agt gac cac tea cag aac cga caa gga ttt aaa etc 
Arg Phe Tyr Ser Asp His Ser Gin Asn Arg Gin Gly Phe Lys Leu 
1595 1600 1605 



4824 



agt tac caa get tat gag tta 
Ser Tyr Gin Ala Tyr Glu Leu 
1610 1615 



cag aac tgc ccg gac cca ccc gca 
Gin Asn Cys Pro Asp Pro Pro Ala 
1620 



4869 
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ttc cag aat ggg ttc atg ate aac tec gat tac age 
Phe Glji Asn Gly Phe Met lie Asn Ser Asp Tyr Ser 
1655 1630 1635 



gtg ggc cag 
Val Gly Gin 



4914 



teg ate tea ttt gag tgc tac 
Ser He Ser Phe Glu Cys Tyr 
1640 1645 



ccg ggc tac ate ttg 
Pro Gly Tyr He Leu 
.1650 



eta ggc cac 
Leu Gly His 



4959 



cet gtg etc ace tgc cag cat ggc act gac agg aac 
Pro Val Leu Thr Cys Gin His Gly Thr Asp Arg Asn 
1655 1660 1665 



tgg aac tac 
Trp Asn Tyr 



5004 



cet ttc cca egg tgt gac get 
Pro Phe Pro Arg Cys Asp Ala 
1670 1675 



ccc tgt ggg tat aat gtg aca tea 
Pro Cys Gly Tyr Asn Val Thr Ser 
1680 



5049 



cag aat ggc acc att tat tec cet ggg ttc cca gac gag tat cca 
Gin Asn Gly Thr He Tyr Ser Pro Gly Phe Pro Asp Glu Tyr Pro 
1685 1690 1695 



5094 



att ctg aag gac tgc ctg tgg 
lie Leu Lys Asp Cys Leu Trp 
1700 1705 



ctg gtc act gtc cet 
Leu Val Thr Val Pro 
1710 



cca gga cat 
Pro Gly His 



5139 



gg a gtg tac ate aac ttc acc 
Gly Val Tyr He Asn Phe Thr 
1715 172 0 



ttg ctg cag act gag get gta aat 
Leu Leu Gin Thr Glu Ala Val Asn 
1725 



5184 



gac tac ate get gtg tgg gat 
Asp Tyr He Ala Val Trp Asp 
1730 1735 



ggt cet gac cag aac 
Gly Pro Asp Gin Asn 
1740 



teg cet cag 
Ser Pro Gin 



5229 



etc ggg gtc ttc agt gga aac act gee etc gag aca 
Leu Gly Val Phe Ser Gly Asn Thr Ala Leu Glu Thr 
1745 1750 1755 



gca tac age 
Ala Tyr Ser 



5274 



tec acc aac cag gtc ttg etc 
Ser Thr Asn Gin Val Leu Leu 
1760 1765 



aaa ttc cac age gat 
Lys Phe His Ser Asp 
1770 



ttc tec aat 5319 
Phe Ser Asn 



gga ggc ttc ttt gtc etc aat ttt cat gca ttt caa 
Gly Gly Phe Phe Val Leu Asn Phe His Ala Phe Gin 
1775 1780 1785 



ctg aag agg 
Leu Lys Arg 



5364 



tgc ccg cet cet cca gta gtg 
Cys Pro Pro Pro Pro Val Val 
1790 1795 



ccg cag get gac ctg ctt aca gaa 
Pro Gin Ala Asp Leu Leu Thr Glu 
1800 



5409 



gat gaa gac ttt gaa ata ggg gac ttc gta aag tac 
Asp Glu Asp Phe Glu He Gly Asp Phe Val Lys Tyr 
1805 1810 ** 1815 



cag tgc cat 
Gin Cys His 



5454 



cca ggg tac acg ctg ttg gga 
Pro Gly Tyr Thr Leu Leu Gly 
1820 1825 



agt gac acc ctg aca tgc aag etc 
Ser Asp Thr Leu Thr Cys Lys Leu 
1830 



5499 



age tea cag eta ttg ttc caa ggc tct cca cet acc 
Ser Ser Gin Leu Leu Phe Gin Gly Ser Pro Pro Thr 
1835 1840 ' 1845 



tgt gaa gca 
Cys Glu Ala 



5544 



caa tgc cca gee aat gaa gtg cga aca gag tct tct ggg gtg att 
Gin Cys Pro Ala Asn Glu Val Arg Thr Glu Ser Ser Gly Val He 
1850 1855 ~ 1860 



5589 
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etc agt cct ggg tac cca 
Leu Ser Pro Gly Tyr Pro 
1865 

get tgg agt att aaa gtg 
Ala Trp Ser lie Lys Val 
1880 

gtg gac acc ttt caa agt 
Val Asp Thr Phe Gin Ser 
1895 

ttt gat ggt tct tct ggg 
Phe Asp Gly Ser Ser Gly 
1910 

ggg aac cac act gaa cag 
Gly Asn His Thr Glu Gin 
1925 

ctg tac etc cgc tgg tec 
Leu Tyr Leu Arg Trp Ser 
1940 

ttc aag att cgc tat gca 
Phe Lys lie Arg Tyr Ala 
1955 

etc aag aat ggt ggc gtt 
Leu Lys Asn Gly Gly Val 
1970 

age aag gtg cag tat ttc 
Ser Lys Val Gin Tyr Phe 
1985 

cac age aac gee acc tgc 
His Ser Asn Ala Thr Cys 
2000 

t-99 9 a c teg atg gca ccg 
Trp Asp Ser Met Ala Pro 
2015 

cca gag get cca gga aat 
Pro Glu Ala Pro Gly Asn 
2030 

tta gac agt aaa gtg act 
Leu Asp Ser Lys Val Thr 
2045 

gat gee agt cag caa gee 
Asp Ala Ser Gin Gin Ala 
2060 

tgg age aac aga gga aag 
Trp Ser Asn Arg Gly Lys 
2075 

ccc age ate gaa ggc cag 
Pro Ser lie Glu Gly Gin 
2090 



-36- 

ggc aac tat ttt aac 
Gly Asn Tyr Phe Asn 
1870 

gag cca aac ttt aac 
Glu Pro Asn Phe Asn 
1885 

gaa aag caa ttt gat 
Glu Lys Gin Phe Asp 
1900 

caa agt cct ttg tta 
Gin Ser Pro Leu Leu 
1915 

tec aat ttt acc age 
Ser Asn Phe Thr Ser 
1930 

aca gat cat gca acc 
Thr Asp His Ala Thr 
1945 

get cct tac tgc age 
Ala Pro Tyr Cys Ser 
1960 

tta aat aaa acc gca 
Leu Asn Lys Thr Ala 
1975 

tgc aag cct gga tat 
Cys Lys Pro Gly Tyr 
1990 

agg egg aac cca gtg 
Arg Arg Asn Pro Val 
2005 

ctt tgc cag get gtg 
Leu Cys Gin Ala Val 
2020 

ggc teg ttc aca ggc 
Gly Ser Phe Thr Gly 
2035 

tat gaa tgt aat gaa 
Tyr Glu Cys Asn Glu 
2050 

act get gtg tgt caa 
Thr Ala Val Cys Gin 
2065 

cca ccc acg tgc aaa 
Pro Pro Thr Cys Lys 
2080 

ctg tea gag cac gtg 
Leu Ser Glu His Val 
2095 



tec cag aca tgt 5634 

Ser Gin Thr Cys 

1875 

att acg etc ttt 5679 

lie Thr Leu Phe 

1890 

gca ctg gaa gta 5724 

Ala Leu Glu Val 

1905 

gtg gtc tta agt 5769 

Val Val Leu Ser 

1920 

aga agt aac cat 5814 

Arg Ser Asn His 

1935 

age aag aaa gga 5859 

Ser Lys Lys Gly 

1950 

etc acc tct aca 5904 

Leu Thr Ser Thr 

1965 

ggc gee ctg ggg 5949 

Gly Ala Leu Gly 

1980 

cga atg att ggc 5994 

Arg Met lie Gly 

1995 

ggc gtg tac cag 6039 

Gly Val Tyr Gin 

2010 

tec tgt gga att 6084 

Ser Cys Gly lie 

2025 

aat gag ttc acc 6129 

Asn Glu Phe Thr 

2040 

ggc ttc aag ctg 6174 

Gly. Phe Lys Leu 

2055 

gaa gat ggc ctg 6219 

Glu Asp Gly Leu 

2070 

ccg gtg ccc tgc 6264 

Pro Val Pro Cys 

2085 

etc tgg agg ctg 6309 

Leu Trp Arg Leu 

2100 
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gtt teg gga tea ttg aat gaa tat gga get caa gtt etc etc age 6354 

Val Ser Gly Ser Leu Asn Glu Tyr Gly Ala Gin Val Leu Leu Ser 
2l65 2110 2115 

tgt agt cct ggc tac ttc ttg cag ggt cag agg ctg ttg cag tgc 6399 

Cys Ser Pro Gly Tyr Phe Leu Gin Gly Gin Arg Leu Leu Gin Cys 
2120 " 2125 2130 

* ••• 1 

caa gee aat ggg ace tgg aac act gag gag gac aga ccc aga tgt 6444 

Gin Ala Asn Gly Thr Trp Asn Thr Glu Glu Asp Arg Pro Arg Cys 
2135 2140 2145 

aaa gtc ate tec tgt gga age ctg tec ttt ccc cca aat ggt aac 6489 
Lys Val lie Ser Cys Gly Ser Leu Ser Phe Pro Pro Asn Gly Asn 

2150 2155 2160 

*. 

aag ata ggg acg etc act atg tat gga gec ace gee ate ttt acc 6534 

Lys lie Gly Thr Leu Thr Met Tyr Gly Ala Thr Ala lie Phe Thr 
2165 2170 2175 

tgc aat acc ggc tac aca ctt gta ggc tec cat gtc egg gag tgc 6579 
Cys Asn Thr Gly Tyr Thr Leu Val Gly Ser His Val Arg Glu Cys 
2180 2185 2190 

ttg gee aat ggt etc tgg age gga tct gaa aca agg tgc ctg gcg 6624 
Leu Ala Asn Gly Leu Trp Ser. Gly Ser Glu Thr Arg Cys Leu Ala 
2195 2200 2205 

ggt cat tgt ggc tct cca gac ccc att gtg aat ggc cat ate agt 6669 
Gly His Cys Gly Ser Pro Asp Pro lie Val Asn Gly His lie Ser 
2210 2215 2220 

ggc gat ggc ttc age tac agg gac aca gtg gtc tac caa tgc aac 6714 
Gly Asp Gly Phe Ser Tyr Arg Asp Thr Val Val Tyr Gin Cys Asn 
2225 2230 2235 

cct ggg ttt cga etc gta ggc acg tct gtg agg att tgc ctg cag 6759 
Pro Gly Phe Arg Leu Val Gly Thr Ser Val Arg lie Cys Leu Gin 
2240 ~ 2245 2250 

gac cac aag tgg teg ggg cag acc ccc gtt tgc gtc ccc ate aca 6804 
Asp His Lys Trp Ser Gly Gin Thr Pro Val Cys Val Pro He Thr 
2255 2260 2265 

tgt gga cac cct gga aac cct gee cat ggc etc acc aac ggc age 6849 
Cys Gly His Pro Gly Asn Pro Ala His Gly Leu Thr Asn Gly Ser 
2270 2275 2280 

gag ttc aac ctg aat gac ctt gtg aat ttc acc tgc cat acg ggc 6894 
Glu Phe Asn Leu Asn Asp Leu Val Asn Phe Thr Cys His Thr Gly 
2285 2290 2295 

tac ctg ctg cag ggt gee tec cga gec caa tgt egg age aac ggc 6939 
Tyr Leu Leu Gin Gly Ala Ser Arg Ala Gin Cys Arg Ser Asn Gly 
2300 2305 2310 

cag tgg age age ccc ttg cct ate tgc cga gtg gtg aac tgt tec 6984 
Gin Trp Ser Ser Pro Leu Pro He Cys Arg Val Val Asn Cys Ser 
2315 2320 2325 

gat cct gga ttt gtg gaa aat gca gtt cgc cac ggg caa cag aac 7029 
Asp Pro Gly Phe Val Glu Asn Ala Val Arg His Gly Gin Gin Asn 
2330 2335 2340 
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ttt cca gag agt ttc gag tat ggg aca agt gtg atg 
Phe Prp Glu Ser Phe Glu Tyr Gly Thr Ser Val Met 
2345 2350 2355 



tat cac tgc 
Tyr His Cys 



7074 



aag aag ggg ttc tac eta ctg ggc tct tct gec ctg 
Lys Lys Gly Phe Tyr Leu Leu Gly Ser Ser Ala Leu 
2360 2365 2370 



acc tgc atg 
Thr Cys Met 



7119 



gca agt ggc ttg tgg gac cgc tec tta ccc aag tgt 
Ala Ser Gly Leu Trp Asp Arg Ser Leu Pro Lys Cys 
2375 2380 2385 



ctg get ata 
Leu Ala lie 



7164 



tea tgt ggg cat cct ggg gtc 
Ser Cys Gly His Pro Gly Val 
2390 2395 



ccc get aat get gtc ctg act gga 
Pro Ala Asn Ala Val Leu Thr Gly 
2400 



7209 



gaa ttg ttt aca ttt gga gec aca gtt cag tac tec 
Glu Leu Phe Thr Phe Gly Ala Thr Val Gin Tyr Ser 
2405 2410 2415 



tgc aaa ggg 
Cys Lys Gly 



7254 



ggc cag att etc aca ggc aat age aca aga gtc tgc 
Gly Gin lie Leu Thr Gly Asn Ser Thr Arg Val Cys 
2420 2425 ^ 2430 



caa gaa gac 
Gin Glu Asp 



7299 



agt cac tgg agt gga tec ctt ccc cat tgt tea gga 
Ser His Trp Ser Gly Ser Leu Pro His Cys Ser Gly 
2435 2440 2445 



aat agt cct 
Asn Ser Pro 



7344 



gga ttt tgt ggt gat cca ggg 
Gly Phe Cys Gly Asp Pro Gly 
2450 2455 



acc cca gca cat ggg tct cgt ctt 
Thr Pro Ala His Gly Ser Arg Leu 
2460 



7389 



ggg gat gag ttt aag aca aag agt ctt ttg cga ttc 
Gly Asp Glu Phe Lys Thr Lys Ser Leu Leu Arg Phe 
2465 2470 2475 



tec tgt gag 
Ser Cys Glu 



7434 



at 9 ggc cac ca g ct 9 c 99 99 fc tct 9 ca 9 a 9 C 9 C aca 
Met Gly His Gin Leu Arg Gly Ser Ala Glu Arg Thr 
2480 2485 2490 



tgc ctg gtg 
Cys Leu Val 



7479 



aat ggg tec tgg tea gga gtc cag cct gtg tgt gag 
Asn Gly Ser Trp Ser Gly Val Gin Pro Val Cys Glu 
2495 2500 2505 



gee gtg tec 
Ala Val Ser 



7524 



tgt gga aac cct ggc acc cct 
Cys Gly Asn Pro Gly Thr Pro 
2510 2515 



acc aat ggg atg ate 
Thr Asn Gly Met lie 
2520 



etc age age 
Leu Ser Ser 



7569 



gat gga ate etc ttc tec age tct gtc ate tat gec 
Asp Gly lie Leu Phe Ser Ser Ser Val lie Tyr Ala 
2525 2530 2535 



tgc tgg gaa 
Cys Trp Glu 



7614 



ggc tac aag acc teg ggg etc 
Gly Tyr Lys Thr Ser Gly Leu 
2540 2545 



atg acg egg cac tgc aca gcg aac 
Met Thr Arg His Cys Thr Ala Asn 
2550 



7659 



ggg aca tgg aca ggc aca gec cct gac tgt aca ate 
Gly Thr Trp Thr Gly Thr Ala Pro Asp Cys Thr He 
2555 2560 2565 



ate age tgt 
He Ser Cys 



. 7704 



ggt gat cct ggc aca ctg ccc 
Gly Asp Pro Gly Thr Leu Pro 
2570 2575 



aa t 99C a tc cag ttt 
Asn Gly He Gin Phe 
2580 



999 aca gac 
Gly Thr Asp 



7749 
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ttc act ttc aac aag acc gtg age tat cag tgc aac 
Phe Thr Phe Asn Lys Thr Val Ser Tyr Gin Cys Asn 
2585 2590 2595 



cct ggc tac 
Pro Gly Tyr 



7794 



ctg atg gag ccc cca aca tea ccc acc ate cgc tgc 
Leu Met Glu Pro Pro Thr Ser Pro Thr lie Arg Cys 
2600 2605 2610 



acc aaa gat 
Thr Lys Asp 



7839 



ggt aca tgg aat cag acc egg ccc etc tgc aaa get gtt eta tgc 
Gly Thr Trp Asn . Gin Thr Arg Pro Leu Cys Lys Ala Val Leu Cys 
2615 2620 2625 



7884 



age cag cct ccc tea gtg cca aac gga aag gtg gag 
Ser Gin Pro Pro Ser Val Pro Asn Gly Lys Val Glu 
2630 2635 2640 



ggg tea gac 
Gly Ser Asp 



7929 



ttc cga . tgg ggt gee age ata age tac agt tgt gtg 
Phe Arg Trp Gly Ala Ser lie Ser Tyr Ser Cys Val 
2645 2650 2655 



gat ggc tac 
Asp Gly Tyr 



7974 



cag etc tec cac teg gee ate ctg tec tgt gaa ggg cgt gga gta 
Gin Leu Ser His Ser Ala lie Leu Ser Cys Glu Gly Arg Gly Val 
2660 2665 2670 



8019 



tgg aaa gga gaa gtc cct cag tgc ttg cct gtg ttc 
Trp Lys Gly Glu Val Pro Gin Cys Leu Pro Val Phe 
2675 2680 2685 



tgt ggc gat 
Cys Gly Asp 



8064 



cca ggc act cca gca gag gga egg etc agt ggg aaa age ttc acc 
Pro Gly Thr Pro Ala Glu Gly Arg Leu Ser Gly Lys Ser Phe Thr 
2690 2695 " 2700 



8109 



ttt aag tct gag gtc ttc ate cag tgc aaa ccc cca 
Phe Lys Ser Glu Val Phe lie Gin Cys Lys Pro Pro 
2705 2710 2715 



ttt gtg tta 
Phe Val Leu 



8154 



9^9 99t tec teg agg aga acc 
Val Gly Ser Ser Arg Arg Thr 
2720 2725 



tgc cag gee gat ggg 
Cys Gin Ala Asp Gly 
2730 



atg tgg agt 
Met Trp Ser 



8199 



ggc ate cag ccc act tgt ata gat cca gee cac acc 
Gly lie Gin Pro Thr Cys lie Asp Pro Ala His Thr 
2735 2740 2745 



get tgc cca 
Ala Cys Pro 



8244 



gac ccc ggc act ccc cac ttt gga ata cag aat age teg aaa gga 
Asp Pro Gly Thr Pro His Phe Gly lie Gin Asn Ser Ser Lys Gly 
2750 2755 2760 



8289 



tac gag gtt gga age act gtg ttc ttc aga tgt aga 
Tyr Glu Val Gly Ser Thr Val Phe Phe Arg Cys Arg 
2765 2770 2775 



aaa ggt tac 
Lys Gly Tyr 



8334 



cac ate caa ggc tec act acc egg acc tgt ctt gee aac etc acg 
His lie Gin Gly Ser Thr Thr Arg Thr Cys Leu Ala Asn Leu Thr 
2780 2785 " 2790 



8379 



tgg agt gga ate cag aca gag tgc ate ccc cat gee tgc egg cag 
Trp Ser Gly He Gin Thr Glu Cys He Pro His Ala Cys Arg Gin 
2795 2800 2805 



8424 



cca gag acc cca gcg cat gca gat gtg aga gee ate gat ctt cca 
Pro Glu Thr Pro Ala His Ala Asp Val Arg Ala He Asp Leu Pro 
2810 2815 2820 



8469 
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get ttt ggc tac acc tta 
Ala Phe Gly Tyr Thr Leu 
2825 

ctt get ggc gga tct gag 
Leu Ala Gly Gly Ser Glu 
. 2840 

tgg aca gga aag tea cct 
Trp Thr Gly Lys . Ser Pro 
2855 

gtt aat gaa aca gtt act 
Val Asn Glu Thr Val Thr 
2870 

ttc ate aac teg gtg tgg 
Phe lie Asn Ser Val Trp 
2885 

aga cag ccg gcg act etc 
Arg Gin Pro Ala Thr Leu 
2900 

age aag gtc aat gcg acc 
Ser Lys Val Asn Ala Thr 
2915 

gag ctg aca ggg gtc tac 
Glu Leu Thr Gly Val Tyr 
2930 

aaa gec ttt cat ate aaa 
Lys Ala Phe His lie Lys 
2945 

ttt gaa aat gac aac tgg 
Phe Glu Asn Asp Asn Trp 
2960 

ctt gag aga gga gga ttc 
Leu Glu Arg Gly Gly Phe 
2975 

gac ttc ggg aag ttc aag 
Asp Phe Gly Lys Phe Lys 
2990 

gat gca gat tct tea aat 
Asp Ala Asp Ser Ser Asn 
3005 

gtg gca get gcg att etc 
Val Ala Ala Ala lie Leu 
3020 

ggg ttt gca ttt tac etc 
Gly Phe Ala Phe Tyr Leu 
3035 

caa tac aat ggc tat get 
Gin Tyr Asn Gly Tyr Ala 
3050 



-40- 

gtc tac acc tgt cat 
Val Tyr Thr Cys His 
2830 

cac agg acg tgt aaa 
His Arg Thr Cys Lys 
2845 

gtt tgt aaa agt aaa 
Val Cys Lys Ser Lys 
2860 

aaa act cca gtt cct 
Lys Thr Pro Val Pro 
2875 

aag gga tat tat gaa 
Lys' Gly Tyr Tyr Glu 
2890 

act gtg gac tgg ttt 
Thr Val Asp Trp Phe 
2905 

ttc acc gca gee tea 
Phe Thr Ala Ala Ser 
2920 

aag aag gaa gag gee 
Lys Lys Glu Glu Ala 
2935 

ggc cca gca gat att 
Gly Pro Ala Asp lie 
2950 

gga etc gat ggt tat 
Gly Leu Asp Gly Tyr 
2965 

tec ttt cag ggt gat 
Ser Phe Gin Gly Asp 
2980 

ctg gaa aga caa gat 
Leu Glu Arg Gin Asp 
2995 

cat tac cag ggc acc 
His Tyr Gin Gly Thr 
3010 

gtc ccc ttc ttc get 
Val Pro Phe Phe Ala 
3025 

tac aaa cac aga aca 
Tyr Lys His Arg Thr 
3040 

ggc cat gaa aac agt 
Gly His Glu Asn Ser 
3055 



cca gga ttt ttc 8514 

Pro Gly Phe Phe 

2835 

gca gac atg aaa 8559 

Ala Asp Met Lys 

2850 

gga gtg aga gaa 8604 

Gly Val Arg Glu 

2865 

tct gat gta ttt 8649 

Ser Asp Val Phe 

2880 

tat tta ggc aag 8694 

Tyr Leu Gly Lys 

2895 

aat gca acc age 8739 

Asn Ala Thr Ser 

2910 

cag gtg cag ctg 8784 

Gin Val Gin Leu 

2925 

cac ctg ctt ctg 8829 

His Leu Leu Leu 

2940 

ttt gta age aag 8874 

Phe Val Ser Lys 

2955 

gta tec tea gga 8919 

Val Ser Ser Gly 

2970 

ata cat gga aaa 8964 

He His Gly Lys 

2985 

cct tec aac tct 9009 

Pro Ser Asn Ser 

3000 

age agt ggc tct 9054 

Ser Ser Gly Ser 

3015 

eta att eta tea 9099 

Leu He Leu Ser 

3030 

aga cca aaa gtt 9144 

Arg Pro Lys Val 

3045 

aat gga caa get 9189 

Asn Gly Gin Ala 

3060 
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tea ttt gaa aac ccc atg tat gat aca aac tta aaa ccc aca gag 9234 
Ser P*}e Glu Asn Pro Met Tyr Asp Thr Asn Leu Lys Pro Thr Glu 
3<f65 3070 " 3075 

.gcc aag get gtg agg ttt gac acg act ctg aac aca gtg tgt aca 9279 
Aia Lys Ala Val Arg Phe Asp Thr Thr Leu Asn Thr Val Cys Thr 
3080 3085 3090 

Stg gta tagccctcag tgccccctag gaccgactca tagccatacc tctgatggac 9335 
Val Val 
3095 



aagcagtaaa 


atcctttggt 


gccatatacc 


acccccttct 


actcttacct 


tgetgeagea 


9395 


acgttggcca 


tegtctgetg 


geataacgea 


gtgggaatgt 


cttctccatc 


atgcccgagt 


9455 


cttctgagga 


teaaattgea 


aatacacctt 


catctggaaa 


gtggcttata 


aaaagecegg 


9515 


ttgetgeate 


caccagaaat 


caagaccccg 


acaacagega 


gggcaaggaa 


gaetgeagag 


9575 


tctcccagac 


cggtggtact 


taatgectet 


gacttttgtg 


tctctgtgtg 


gecaggatge 


9635 


ctttggtgta 


gtcttctgag 


cacaccgata 


catccctcag 


gtgeggegae 


aacatggtag 


9695 


ccacttgatg 


tgtgttttgt 


gtttttctgt 


tttcttttca 


accctatcca 


ctggacatga 


9755 


attctttaca 


aaagaaaagc 


cttcctggag 


aagaegcett 


ctggaaaatg 


cacacacaga 


9815 


egctttgett 


ctgccctgcc 


tgagacagga 


gctctccgga 


tcttcaggct 


ccactgggcg 


9875 


tccatcagcc 


actagggatg 


tttgeagate 


tcacagtcag 


agctggtcca 


tcccagagtt 


9935 


ttttgatget 


caacattttg 


cactagtgtg 


tcaagaatga 


ctaagtctga 


tttctaaaca 


9995 


aactatttcc 


acagggtttg 


tatccactat 


acattgtaca 


tacgeatttt 


ctcataccgt 


10055 


attctcaagc 


aaatgatgee 


actgtcagtt 


aagtttggga 


tgcaaaggaa 


ggtctccccg 


10115 


gatacagaac 


agattttgaa 


aaggagatag 


tgctagtaat 


gctgaagaag 


ttactctttt 


10175 


aattgettet 


gttggccaca 


ttttcatgtc 


aaattcattg 


cctacttcca 


gtggtggaaa 


10235 


tgaagcccgt 


gtattcccct 


tggtatcccc 


ccacttcatg 


tgeatacgae 


tattgtctac 


10295 


accatactaa 


tcaataacag 


ggggctccag 


caatgtctgt 


tttccatgta 


cagatgtgaa 


10355 


tagtaattta 


ttttaggtag 


ctcatgaact 


cagttcacag 


tgaagtcttc 


ccttccggat 


10415 


tgtttccttt 


ctctgttttg 


taacatcacc 


ctcccagaat 


gcattgagag 


tctatctcac 


10475 


agccacaccc 


aagctcagag gaatcgaaag 


ggaaatcaaa 


gaagtccaaa 


teagaategg 


10535 


aagggcaggc 


accgctcgca caccctcatg 


atgatctgtt 


ttatagatta 


tttgecttte 


10595 


tgcaaaaaaa 


aaatcattac 


agtgattttt 


gaaacattaa 


aattccttac 


tgatagacta 


10655 


tctattgtga 


tatatataag ataggtggta 


tggccaacag 


ggataaaata 


aacagectaa 


10715 


agacaaggca 


gggctagaga aatgtctgta 


agaaatttca 


aagagaagat 


catgtttatt 


10775 


tttatttata 


tttgtttgat 


aaaagtattt 


ttggaaatat 


aatgettatt 


ttattatttg 


10835 


acgtttcatg 


cacagtccac 


gtggtaaaaa 


tccccccttt 


gtacatccca 


gatttgeact 


10895 
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gatacatggg 


tcaggatgtc 


atgctgatgt 


tctgtttgct 


gtggtgacta 


cattcatgcc 


10955 


tagctttaag 


acaggtggat 


ctgtctatct 


acatgatgtt 


taaatgcagg 


acttcccaga 


11015 


ggacagtggg 


taacggaaca 


tggcttgctt 


gcggctttgg 


aagttcagca 


ttctgagcgt 


11075 


tccagaggcc 


cggctgggct 


ccctccttct 


agcccactgt 


tcttgcaagg 


gctgtctgtt 


11135 


gtgtgccagg 


gctcctgact 


tcttctgctg 


acactctgtc 


cactgggttc 


catattccag 


11195 


gactccatgt 


cctaggaaag 


agttttgaca 


taggttcctc 


cagccaagcc 


gacacacatc 


11255 


cacggggttc 


ctctgggctc 


cacagaggtt 


cttcattggc 


tccctgggat 


aaattcagat 


11315 


gatgtcagca 


agagtgtgct 


tctataccac 


acattgagcc 


aaaacaaaac 


agagaacgtc 


11375 


agaaggtcca 


cggaccagag 


tgcgcaaggg 


agaacagggt 


tactatatat 


attagatgta 


11435 


tataaaaaca 


cacacacaaa' 


catatatata 


ttgtacatat 


ctaagtttga 


gtcactcaga 


11495 


ctaggtgcaa 


aatgctgact 


ttggagtcta 


aactaacgtc 


tctgtcccca 


catccctggc 


11555 


ctctttcctg 


gccagttaca 


ttaagaagac 


ttgacttaga 


cagggcatac 


atacatgcaa 


11615 


ggaaccacat 


catcagacca 


gtgtcgtttt 


cctttgtgtg 


caaactgacc 


tacagctacc 


11675 


agactgcatc 


atggtattta 


aaaccaacat 


acaatattga 


gcggcactct 


cagttgagag 


11735 


cctagctcaa 


tccttcctag 


gannnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


11795 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnncaccagg 


tctcagaggc 


attgaagacc 


11855 


tagcaggaca 


gtcaggaaca 


ccttcctcag 


tgaggtctag 


acttttccct 


gaagcgccca 


11915 


gagcacagtg 


aggagtcacg 


ctctatgaat 


gacaggttat 


gtgctttgaa 


gctgttcaac 


11975 


tgttgcttgt 


ctttgcccat 


cttgccttca 


ggctagctgc 


aataattttt 


ttcttctgta 


12035 


aaatattttg 


taaacaataa 


caacaacaac 


aaaagctatt 


ataaaaaggg 


agaaaagaaa 


12095 


gctggcatta 


tgatcaggaa 


aaccatccat 


tcttgctgcc 


ccccccctcc 


tgtctccacc 


12155 


acacgctgct 


gtcacaacgt 


aggtgcggaa 


gacctttttg 


tacagagata 


tattttttat 


12215 


gaagaatttg 


taaaattatt 


aaatatgctg 


taattttttg 


attaatgtag 


gtaaattgtt 


12275 


aaaaaataaa 


tgtttttaca 


atatgaaact 


gtaattttcc 


cccataatgt 


aacattaccc 


12335 


tctctaacta 


attttcagtt 


ccaatcctat 


tcgaacatgt 


attaatatta 


aggcggcctg 


12395 


ttaaaatgaa 


cagtatcttt 


ttttttgtca 


aaaaaaatta 


taaagagagt 


gtaacataac 


12455 


ctgtgtaatg 


ccacctatct 


ttaaagcaaa 


tcagagttct 


aattaaatat 


ttaattttag 


12515 


atttcaaaaa 












12525 



<210> 4 
<211> 3095 
<212> PRT 
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<213> Rattus rattus 
<400> 4 

Asp Ala Gly Lys Val Gly Asp Thr Arg Ser Val Leu Tyr Val Leu Thr 
1 5 10 15 



Gly Ser Ser Val Pro Asp Leu lie Val Ser Met Ser Asn Gin Met Trp 
20 25 30 



Leu His Leu Gin Ser Asp Asp Ser lie Gly Ser Pro Gly Phe Lys Ala 
35 40 45 



Val Tyr Gin Glu He Glu Lys Gly Gly Cys Gly Asp Pro Gly He Pro 
50 55 60 



Ala Tyr Gly Lys Arg Thr Gly Ser Ser Phe Leu His Gly Asp Thr Leu 
65 70 75 80 



Thr Phe Glu Cys Gin Ala Ala Phe Glu Leu Val Gly Glu Arg Val He 
85 90 95 



Thr Cys Gin Arg Asn Asn Gin Trp Ser Gly Asn Lys Pro Ser Cys Val 
100 " 105 110 



Phe Ser Cys Phe Phe Asn Phe Thr Ala Ser Ser Gly lie He Leu Ser 
115 120 125 



Pro Asn Tyr Pro Glu Glu Tyr Gly Asn Asn Met Asn Cys Val Trp Leu 
130 135 140 



He He Ser Glu Pro Gly Ser Arg He His Leu He Phe Asn Asp Phe 
145 150 155 160 



Asp Val Glu Pro Gin Phe Asp Phe Leu Ala Val Lys Asp Asp Gly He 
165 170 * ~ 175 



Ser Asp He Thr Val Leu Gly Thr Phe Ser Gly Asn Glu Val Pro Ala 
180 185 190 



Gin Leu Ala Xaa Ser Gly His He Val Arg Leu Glu Phe Gin Ser Asp 
195 200 205 



His Ser Thr Thr Gly Arg Gly Phe Asn He He Tyr Thr Thr Phe Gly 
210 215 220 



WO 02/10199 



PCT/US01/23232 



-44- 

Gln Asn Glu Cys His Asp Pro Gly He Pro Val Asn Gly Arg Arg Phe 
2^5 ; 230 235 240 



Gly Asp Arg Phe Leu Leu Gly Ser Ser Val Ser Phe His Cys Asp Asp 
245 250 255 

Gly Phe Val Lys Thr Gin Gly Ser Glu Ser He Thr Cys He Leu Gin 
260 " 265 270 

Asp Gly Asn Val Val Trp Ser Ser Thr Val Pro Arg Cys Glu Ala Pro 
275 280 285 

Cys Gly Gly His Leu Thr Ala Ser Ser Gly Val He Leu Pro Pro Gly 
290 295 300 

Trp Pro Gly Tyr Tyr Lys Asp Ser Leu Asn Cys Glu Trp Val He Glu 
305 " 310 315 320 

Ala Lys Pro Gly His Ser He Lys He Thr Phe Asp Arg Phe Gin Thr 
325 330 335 

Glu Val Asn Tyr Asp Thr Leu Glu Val Arg Asp Gly Pro Thr Ser Ser 
340 345 350 

Ser Pro Leu He Gly Glu Tyr His Gly Thr Gin Ala Pro Gin Phe Leu 
355 360 365 



He Ser Thr Gly Asn Tyr Met Tyr Leu Leu Phe Thr Thr Asp Ser Ser 
370 375 380 

Arg Ala Ser Val Gly Phe Leu He His Tyr Glu Ser Val Thr Leu Glu 
385 390 395 400 

Ser Asp Ser Cys Leu Asp Pro Gly He Pro Val Asn Gly His Arg His 
405 410 415 



Gly Ser Asn Phe Gly He Arg Ser Thr Val Thr Phe Ser Cys Asp Pro 
420 425 430 



Gly Tyr Thr Leu Ser Asp Asp Asp Pro Leu He Cys Glu Lys Asn His 
435 440 445 



Gin Trp Asn His Ala Leu Pro Ser Cys Asp Ala-Leu Cys Gly Gly Tyr 
450 455 460 



He His Gly Lys Ser Gly Thr Val Leu Ser Pro Gly Phe Pro Asp Phe 
465 470 475 480 
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Tyr Pro Asn Ser Leu Asn Cys Thr Trp Thr lie Glu Val Ser His Gly 
485 " 490 495 



Lys Gly Val Gin Met Asn Phe His Thr Phe His Leu Glu Ser Ser His 
500 505 510 



Asp Tyr Leu Leu lie Thr Glu Asp Gly Ser Phe Ser Glu Pro Val Ala 
515 520 525 



Arg Leu Thr Gly Ser Val Leu Pro His Thr He Lys Ala Gly Leu Phe 
530 " 535 540 



Gly Asn Phe Thr Ala Gin Leu Arg Phe He Ser Asp Phe Ser He Ser 
545 550 ~ 555 560 



Tyr Glu Gly Phe Asn He Thr Phe Ala Glu Tyr Asp Leu Glu Pro Cys 
565 570 575 



Asp Asp Pro Gly Val Pro Ala Tyr Ser Arg Arg He Gly Phe Gin Phe 
580 585 590 



Gly Val Gly Asp Thr Leu Ala Phe Thr Cys Phe Gin Gly Tyr Arg Leu 
595 600 605 



Glu Gly Ala Thr Lys Leu Thr Cys Leu Gly Gly Gly Arg Arg Val Trp 
610 615 620 



Ser Ala Pro Leu Pro Arg Cys Val Ala Glu Cys Gly Ala Ser Val Lys 
625 630 635 640 



Gly Asn Glu Gly Thr Leu Leu Ser Pro Asn Phe Pro Ser Asn Tyr Asp 
645 650 655 



Asn Asn His Glu Cys He Tyr Lys He Glu Thr Glu Ala Gly Lys Gly 
660 665 670 



He His Leu Arg Ala Arg Thr Phe Gin Leu Phe Glu Gly Asp Thr Leu 
675 680 685 



Lys Val Tyr Asp Gly Lys Asp Ser Ser Ser Arg Ser Leu Gly Val Phe 
690 695 700 



Thr Arg Ser Glu Leu Met Gly Leu Val Leu Asn Ser Thr Ser Asn His 
705 710 715 720 



Leu Arg Leu Glu Phe Asn Ser Asn Gly Ser Asp Thr Ala Gin Gly Phe 
725 ~ 730 735 
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Gln Leu Thr Tyr Thr Ser Phe Asp Leu Val Lys Cys Glu Asp Pro Gly 
740 745 750 



lie Pro Asn Tyr Gly Tyr Arg lie Arg Asp Asp Gly His Phe Thr Asp 
755 760 765 



Thr Val Val Leu Tyr Ser Cys Asn Pro Gly Tyr Ala Met His Gly Ser 
770 775 780 



Ser Thr Leu Thr Cys Leu Ser Gly Asp Arg Arg Val Trp Asp Lys Pro 
785 790 795 800 

Met Pro Ser Cys Val Ala Glu Cys Gly Gly Leu Val His Ala Ala Thr 
805 810 815 



Ser. Gly Arg lie Leu Ser Pro Gly Tyr Pro Ala Pro Tyr Asp Asn Asn 
820 825 830 



Leu His Cys Thr Trp Thr lie Glu Ala Asp Pro Gly Lys Thr Xaa Ser 
835 840 845 

Leu His Phe He Val Phe Asp Thr Glu Thr Ala His Asp He Leu Lys 
850 855 860 



Val Trp Asp Gly Pro Val Asp Ser Asn He Leu Leu Lys Glu Trp Ser 
865 870 875 880 

Gly Ser Ala Leu Pro Glu Asp He His Ser Thr Phe Asn Ser Leu Thr 

885 890 895 



Leu Gin Phe Asp Ser Asp Phe Phe He Ser Lys Ser Gly Phe Ser He 
900 905 910 



Gin Phe Ser Thr Ser He Ala Ser Thr Cys Asn Asp Pro Gly Met Pro 
915 920 925 

Gin Asn Gly Thr Arg Tyr Gly Asp Ser Arg Glu Pro Gly Asp Thr He 
930 " 935 940 



Thr Phe Gin Cys Asp Pro Gly Tyr Gin Leu Gin Gly Gin Ala Lys lie 
945 950 955 960 



Thr Cys Val Gin Leu Asn Asn Arg Phe Phe Trp Gin Pro Asp Pro Pro 
965 970 975 



Ser Cys He Ala Ala Cys Gly Gly Asn Leu Thr Gly Pro Ala Gly Val 
980 985 990 
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Ile Leu Ser Pro Asn Tyr Pro Gin Pro Tyr Pro Pro Gly Lys Glu Cys 
995 1000 1005 

F 

Asp Trp Arg He Lys Val Asn Pro Asp Phe Val He Ala Leu He 
1010 * 1015 1020 

Phe Lys Ser Phe Ser Met Glu Pro Ser Tyr Asp Phe Leu His He 
^' 1025 1030 1035 



ijfi ox ca v?x_y xjy o noy oox now ooj- riw ucu lie oxy oci rue oxii 

1040 1045 1050 



Gly Ser Gin Ala Pro Glu Arg He Glu Ser Ser Gly Asn Ser Leu 
1055 1060 1065 



Phe Leu Ala Phe Arg Ser Asp Ala Ser Val Gly Leu Ser Gly Phe 
1070 1075 1080 



Ala He Glu Phe Lys Glu Lys Pro Arg Glu Ala Cys Phe Asp Pro 
1085 1090 1095 



Gly Asn He Met Asn Gly Thr Arg He Gly Thr Asp Phe Lys Leu 
1100 1105 1110 



Gly Ser Thr Val Thr Tyr Gin Cys Asp Ser Gly Tyr Lys He Val 
1115 1120 1125 



Asp Pro Ser Ser He Glu Cys Val Thr Gly Ala Asp Gly Lys Pro 
1130 1135 1140 



Ser Trp Asp Arg Ala Leu Pro Ala Cys Gin Ala Pro Cys Gly Gly 
1145 1150 1155 



Gin Tyr Met Gly Ser Glu Gly Val Val Leu Ser Pro Asn Tyr Pro 
1160 1165 1170 



His Asn Tyr Thr Ala Gly Gin He Cys He Tyr Ser He Thr Val 
1175 1180 1185 



Pro Lys Glu Phe Val Val Phe Gly Gin Phe Ala Tyr Phe Gin Thr 
1190 1195 1200 



Ala Leu Asn Asp Leu Ala Glu Leu Phe Asp Gly Thr His Pro Gin 
1205 1210 1215 



Ala Arg Leu Leu Ser Ser Leu Ser Gly Ser His Ser Gly Glu Thr 
1220 1225 1230 
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Leu Pro Leu Ala Thr Ser Asn Gin He Leu Leu Arg Phe Ser Ala 
1235 1240 1245 

Lys Ser Gly Ala Ser Ala Arg Gly Phe His Phe Val Tyr Gin Ala 
1250 1255 1260 



Val Pro Arg Thr Ser Asp Thr Gin Cys Ser Ser Val Pro Glu Pro 
1265 1270 1275 



Arg Tyr Gly Arg Arg He Gly Ser Glu Phe Ser Ala Gly Ser He 
1280 ' 1285 1290 



Val Arg Phe Glu Cys Asn Pro Gly Tyr Leu Leu Gin Gly Ser Thr 
1295 1300 1305 



ALa He Arg Cys Gin Ser Val Pro Asn Ala Leu Ala Gin Trp Asn 

1310 "~ 1315 1320 

l Asp Thr He Pro Ser Cys Val Val Pro Cys Ser Gly Asn Phe Thr 

1325 1330 1335 

Gin Arg Arg Gly Thr lie Leu Ser Pro Gly Tyr Pro Glu Pro Tyr 

1340 " 1345 1350 

Gly Asn Asn Leu Asn Cys Val Trp Lys He He Val Ser Glu Gly 

1355 1360 1365 

Ser Gly He Gin He Gin Val He Ser Phe Ala Thr Glu Gin Asn 

1370 1375 1380 

Trp Asp Ser Leu Glu He His Asp Gly Gly Asp Met Thr Ala Pro 

1385 1390 1395 

Arg Leu Gly Ser Phe Ser Gly Thr Thr Val Pro Ala Leu Leu Asn 

1400 1405 1410 



Ser Thr Ser Asn Gin teu Cys Leu His Phe Gin Ser Asp He Ser 
1415 1420 1425 



Val Ala Ala Ala Gly Phe His Leu Glu Tyr Lys Thr Val Gly Leu 
1430 1435 1440 



Ala Ala Cys Gin Glu Pro Ala Leu Pro Ser Asn Gly He Lys He 
1445 1450 1455 



Gly Asp Arg Tyr Met Val Asn Asp Val Leu Ser Phe Gin Cys Glu 
1460 " 1465 * 1470 
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Pro Gly Tyr Thr Leu Gin Gly Arg Ser His lie Ser Cys Met Pro 
1475 1480 1485 

Gly Thr val Arg Arg Trp Asn Tyr Pro Ser Pro Leu Cys lie Ala 
1490 1495 1500 

Thr Cys Gly Gly Thr Leu Thr Ser Met Ser Gly Val He Leu Ser 
1505 " 1510 1515 

Pro Gly Phe Pro Gly Ser Tyr Pro Asn Asn Leu Asp Cys Thr Trp 
1520 1525 1530 

Lys He Ser Leu Pro He Gly Tyr Gly Ala His He Gin Phe Leu 
1535 1540 1545 

Asn Phe Ser Thr Glu Ala Asn His Asp Tyr Leu Glu He Gin Asn 
1550 1555 1560 

Gly Pro Tyr His Ser Ser Pro Met Met Gly Gin Phe Ser Gly Pro 
1565 1570 1575 

Asp Leu Pro Ala Ser Leu Leu Ser Thr Thr His Glu Thr Leu He 
1580 1585 1590 

Arg Phe Tyr Ser Asp His Ser Gin Asn Arg Gin Gly Phe Lys Leu 
1595 1600 1605 

Ser Tyr Gin 'Ala Tyr Glu Leu Gin Asn Cys Pro Asp Pro Pro Ala 
1610 1615 1620 

Phe Gin Asn Gly Phe Met He Asn Ser Asp Tyr Ser Val Gly Gin 
1625 1630 1635 

Ser He Ser Phe Glu Cys Tyr Pro Gly Tyr He Leu Leu Gly His 
1640 1645 1650 



Pro Val Leu Thr Cys Gin His Gly Thr Asp Arg Asn Trp Asn Tyr 
1655 1660 1665 



Pro Phe Pro Arg Cys Asp Ala Pro Cys Gly Tyr Asn Val Thr Ser 
1670 ~ 1675 1680 



Gin Asn Gly Thr He Tyr Ser Pro Gly Phe Pro Asp Glu Tyr Pro 
1685 1690 1695 



He Leu Lys Asp Cys Leu Trp Leu Val Thr Val Pro Pro Gly His 
1700 1705 1710 
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Gly Val Tyr He Asn Phe Thr Leu Leu Gin Thr Glu Ala Val Asn 
1715 1720 1725 

Asp Tyr He Ala Val Trp Asp Gly Pro Asp Gin Asn Ser Pro Gin 
1730 1735 1740 



Leu Gly Val Phe Ser Gly Asn Thr Ala Leu Glu Thr Ala Tyr Ser 
1745 1750 1755 



Ser Thr Asn Gin Val Leu Leu Lys Phe His Ser Asp Phe Ser Asn 
1760 1765 1770 



Gly Gly Phe Phe Val Leu Asn Phe His Ala Phe Gin Leu Lys Arg 
1775 1780 1785 

Cys Pro Pro Pro Pro Val Val Pro Gin Ala Asp Leu Leu Thr Glu 
1790 1795 1800 

Asp Glu Asp Phe Glu He Gly Asp Phe Val Lys Tyr Gin Cys His 
1805 1810 1815 



Pro Gly Tyr Thr Leu Leu Gly Ser Asp Thr Leu Thr Cys Lys Leu 
1820 1825 1830 

Ser Ser Gin Leu Leu Phe Gin Gly Ser Pro Pro Thr Cys Glu Ala 
1835 1840 1845 

Gin Cys Pro Ala Asn Glu Val Arg Thr Glu Ser Ser Gly Val He 
1850 1855 I860 



Leu Ser Pro Gly Tyr Pro Gly Asn Tyr Phe Asn Ser Gin Thr Cys 
1865 1870 1875 

Ala Trp Ser He Lys Val Glu Pro Asn Phe Asn He Thr Leu Phe 
1880 1885 1890 



Val Asp Thr Phe Gin Ser Glu Lys Gin Phe Asp Ala Leu Glu Val 
1895 1900 1905 



Phe Asp Gly Ser Ser Gly Gin Ser Pro Leu Leu Val Val Leu Ser 
1910 1915 1920 



Gly Asn His Thr Glu Gin Ser Asn Phe Thr Ser Arg Ser Asn His 
1925 1930 1935 



Leu Tyr Leu Arg Trp Ser Thr Asp His Ala Thr Ser Lys Lys Gly 
1940 1945 1950 
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Phe Lys lie Arg Tyr Ala Ala Pro Tyr Cys Ser Leu Thr Ser Thr 
19£5 1960 1965 

Leu Lys Asn Gly Gly Val Leu Asn Lys Thr Ala Gly Ala Leu Gly 
1970 " 1975 1980 



Ser Lys Val Gin Tyr Phe Cys Lys Pro Gly Tyr Arg Met lie Gly 
— 1985 1990 * 1995 



His Ser Asn Ala Thr Cys Arg Arg Asn Pro Val Gly Val Tyr Gin 
2000 2005 2010 



Trp Asp Ser Met Ala Pro Leu Cys Gin Ala Val Ser Cys Gly lie 
2015 2020 2025 



Pro Glu Ala Pro Gly Asn Gly Ser Phe Thr Gly Asn Glu Phe Thr 
2030 2035 2040 



Leu Asp Ser Lys Val Thr Tyr Glu Cys Asn Glu Gly Phe Lys Leu 
2045 ~ 2050 2055 



Asp Ala Ser Gin Gin Ala Thr Ala Val Cys Gin Glu Asp Gly Leu 
2060 2065 2070 



Trp Ser Asn Arg Gly Lys Pro Pro Thr Cys Lys Pro Val Pro Cys 
2075 2080 2085 



Pro Ser lie Glu Gly Gin Leu Ser Glu His Val Leu Trp Arg Leu 
2090 2095 2100 



Val Ser Gly Ser Leu Asn Glu Tyr Gly Ala Gin Val Leu Leu Ser 
2105 2110 2115 



Cys Ser Pro Gly Tyr Phe Leu Gin Gly Gin Arg Leu Leu Gin Cys 
2120 2125 2130 



Gin Ala Asn Gly Thr Trp Asn Thr Glu Glu Asp Arg Pro Arg Cys 
2135 2140 2145 



Lys Val lie Ser Cys Gly Ser Leu Ser Phe Pro Pro Asn Gly Asn 
2150 2155 2160 



Lys lie Gly Thr Leu Thr Met Tyr Gly Ala Thr Ala lie Phe Thr 
2165 2170 2175 



Cys Asn Thr Gly Tyr Thr Leu Val Gly- Ser His Val Arg Glu Cys 
2180 2185 2190 
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Leu Ala Asn Gly Leu Trp Ser Gly Ser Glu Thr Arg Cys Leu Ala 
2195 2200 2205 



Gly His Cys Gly Ser Pro Asp Pro lie Val Asn Gly His lie Ser 
2210 " 2215 2220 



dly Asp Gly Phe Ser Tyr Arg Asp Thr Val Val Tyr Gin Cys Asn 
2225 " ' 2230 2235 



Pro Gly Phe Arg Leu Val Gly Thr Ser Val Arg lie Cys Leu Gin 
2240 " 2245 2250 



Asp His Lys Trp Ser Gly Gin Thr Pro Val Cys Val Pro lie Thr 
2255 ~ 2260 2265 



Cys Gly His Pro Gly Asn Pro Ala His Gly Leu Thr Asn Gly Ser 
2270 2275 2280 



Glu Phe Asn Leu Asn Asp Leu Val Asn Phe Thr Cys His Thr Gly 
2285 2290 2295 



Tyr Leu Leu Gin Gly Ala Ser Arg Ala Gin Cys Arg Ser Asn Gly 
2300 2305 2310 



Gin Trp Ser Ser Pro Leu Pro lie Cys Arg Val Val Asn Cys Ser 
2315 2320 2325 



Asp Pro Gly Phe Val Glu Asn Ala Val Arg His Gly Gin Gin Asn 
2330 2335 2340 



Phe Pro Glu Ser Phe Glu Tyr Gly Thr Ser Val Met Tyr His Cys 
2345 2350 2355 



Lys Lys Gly Phe Tyr Leu Leu Gly Ser Ser Ala Leu Thr Cys Met 
2360 2365 2370 



Ala Ser Gly Leu Trp Asp Arg Ser Leu Pro Lys Cys Leu Ala He 
2375 ~ " 2380 2385 



Ser Cys Gly His Pro Gly Val Pro Ala Asn Ala Val Leu Thr Gly 
2390 2395 2400 



Glu Leu Phe Thr Phe Gly Ala Thr Val Gin Tyr Ser Cys Lys Gly 
2405 2410 2415 



Gly Gin He Leu Thr Gly Asn Ser Thr Arg Val Cys Gin Glu Asp 
2420 2425 2430 
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Ser His Trp Ser Gly Ser Leu Pro His Cys Ser Gly Asn Ser Pro 
24^5 2440 2445 

Gly Phe Cys Gly Asp Pro Gly Thr Pro Ala His Gly Ser Arg Leu 
2450 " ~ 2455 2460 



Gly Asp Glu Phe Lys Thr Lys Ser Leu Leu Arg Phe Ser Cys Glu 
— 2465 ^ 2470 2475 



Met Gly His Gin Leu Arg Gly Ser Ala Glu Arg Thr Cys Leu Val 
2480 2485 2490 



Asn Gly Ser Trp Ser Gly Val Gin Pro Val Cys Glu Ala Val Ser 
2495 2500 2505 



Cys Gly Asn Pro Gly Thr Pro Thr Asn Gly Met lie Leu Ser Ser 
2510 2515 2520 



Asp Gly lie Leu Phe Ser Ser Ser Val lie Tyr Ala Cys Trp Glu 
2525 2530 2535 



Gly Tyr Lys Thr Ser Gly Leu Met Thr Arg His Cys Thr Ala Asn 
2540 * 2545 2550 



Gly Thr Trp Thr Gly Thr Ala Pro Asp Cys Thr lie lie Ser Cys 
2555 * 2560 2565 



Gly Asp Pro Gly Thr Leu Pro Asn Gly lie Gin Phe Gly Thr Asp 
2570 2575 2580 



Phe Thr Phe Asn Lys Thr Val Ser Tyr Gin Cys Asn Pro Gly Tyr 
2585 2590 2595 



Leu Met Glu Pro Pro Thr Ser Pro Thr lie Arg Cys Thr Lys Asp 
2600 2605 2610 



Gly Thr Trp Asn Gin Thr Arg Pro Leu Cys Lys Ala Val Leu Cys 
2615 2620 2625 



Ser Gin Pro Pro Ser Val Pro Asn Gly Lys Val Glu Gly Ser Asp 
2630 2635 2640 



Phe Arg Trp Gly Ala Ser lie Ser Tyr Ser Cys Val Asp Gly Tyr 
2645 2650 2655 



Gin Leu Ser His Ser Ala lie Leu Ser Cys Glu Gly Arg Gly Val 
2660 2665 2670 
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Trp Lys Gly Glu Val Pro Gin Cys Leu Pro Val Phe Cys Gly Asp 
2675 " 2680 2685 

Pro Gly Thr Pro Ala Glu Gly Arg Leu Ser Gly Lys Ser Phe Thr 
2690 2695 2700 

Phe Lys Ser Glu Val Phe lie Gin Cys Lys Pro Pro Phe Val Leu 
~ 2705 2710 2715 

Val Gly Ser Ser Arg Arg Thr Cys Gin Ala Asp Gly Met Trp Ser 
2720 2725 2730 

». 

Gly lie Gin Pro Thr Cys lie Asp Pro Ala His Thr Ala Cys Pro 
2735 2740 2745 

Asp Pro Gly Thr Pro His Phe Gly lie Gin Asn Ser Ser Lys Gly 
2750 2755 2760 

Tyr Glu Val Gly Ser Thr Val Phe Phe Arg Cys* Arg Lys Gly Tyr 
2765 2770 2775 

His lie Gin Gly Ser Thr Thr Arg Thr Cys Leu Ala Asn Leu Thr 
2780 2785 2790 

Trp Ser Gly He Gin Thr Glu Cys He Pro His Ala Cys Arg Gin 
2795 ' 2800 2805 

Pro Glu Thr Pro Ala His Ala Asp Val Arg Ala He Asp Leu Pro 
2810 2815 2820 

Ala Phe Gly Tyr Thr Leu Val Tyr Thr Cys His Pro Gly Phe Phe 
2825 " 2830 2835 

Leu Ala Gly Gly Ser Glu His Arg Thr Cys Lys Ala Asp Met Lys 
2840 " 2845 2850 

Trp Thr Gly Lys Ser Pro Val Cys Lys Ser Lys Gly Val Arg Glu 
2855 2860 2865 

Val Asn Glu Thr Val Thr Lys Thr Pro Val Pro Ser Asp Val Phe 
2870 2875 . 2880 

Phe He Asn Ser Val Trp Lys Gly Tyr Tyr Glu Tyr Leu Gly Lys 
2885 2890 2895 



Arg Gin Pro Ala Thr Leu Thr Val Asp Trp Phe Asn Ala Thr Ser 
2900 2905 2910 
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Ser Lys Val Asn Ala Thr Phe Thr Ala Ala Ser Gin Val Gin Leu 
2915 2920 2925 



Glu Leu Thr Gly Val Tyr Lys Lys Glu Glu Ala His Leu Leu Leu 
2930 2935 2940 



Lys Ala Phe His lie Lys Gly Pro Ala Asp He Phe Val Ser Lys 
2945 2950 2955 



Fhe Glu Asn Asp Asn Trp uiy Leu Asp uiy Tyr Val Ser ser Gly 
2960 2965 2970 



Leu Glu Arg Gly Gly Phe Ser Phe Gin Gly Asp He His Gly Lys 
2975 2980 2985 



Asp Phe Gly Lys Phe Lys Leu Glu Arg Gin Asp Pro Ser Asn Ser 
2990 2995 3000 



Asp Ala Asp Ser Ser Asn His Tyr Gin Gly Thr Ser Ser Gly Ser 
3005 3010 * 3015 



Val Ala Ala Ala He Leu Val Pro Phe Phe Ala Leu He Leu Ser 
3020 3025 3030 



Gly Phe Ala Phe Tyr Leu Tyr Lys His Arg Thr Arg Pro Lys Val 
3035 3040 3045 



Gin Tyr Asn Gly Tyr Ala Gly His Glu Asn Ser Asn Gly Gin Ala 
3050 3055 3060 



Ser Phe Glu Asn Pro Met Tyr Asp Thr Asn Leu Lys Pro Thr Glu 
3065 3070 3075 

Ala Lys Ala Val Arg Phe Asp Thr Thr Leu Asn Thr Val Cys Thr 
3080 3085 3090 



Val Val 
3095 



<210> 5 

<211> 2527 

<212> PRT 

<213> Homo sapiens 



<220> 
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<221> 



misc 



<222> 



(684) . . (684) 



<223> 



X = amino acid 



<220> 



<221> misc 

<222> (1134) . - (1134) 

<223> X = amino acid 



<400> 5 

Lys Ser Cys Arg Asn Pro Pro Asp Pro Val Asn Gly Met Val His Val 
15 10 .15 

lie Lys Gly He Gin Phe Gly Ser Gin He Lys Tyr Ser Cys Thr Lys 
20 25 30 

Gly Tyr Arg Leu He Gly Ser Ser Ser Ala Thr Cys He He Ser Gly 
35 40 45 

Asp Thr Gin Asn Cys Pro Asp Pro Pro Pro Phe Gin Asn Gly Tyr Met 
50 55 60 

He Asn Ser Asp Tyr Ser Val Gly Gin Ser Val Ser Phe Glu Cys Tyr 
65 70 75 80 

Pro Gly Tyr He Leu He Gly His Pro Val Leu Thr Cys Gin His Gly 
85 90 95 

He Asn Val He Trp Asp Asn Glu Thr Pro He Cys Asp Arg He Pro 
100 105 110 

Cys Gly Leu Pro Pro Thr He Thr Asn Gly Asp Phe He Ser Thr Asn 
115 120 125 

Arg Glu Asn Phe His Tyr Gly Ser Val Val Thr Tyr Arg Cys Asn Pro 
130 135 140 

Gly Arg Asn Trp Asn Tyr Pro Phe Pro Arg Cys Asp Ala Pro Cys Gly 
145 ^ 150 155 160 

Tyr Asn Val Thr Ser Gin Asn Gly Thr He Tyr Ser Pro Gly Phe Pro 
165 170 175 

Asp Glu Tyr Pro He Leu Lys Asp Cys lie Trp Leu He Thr Val Pro 
180 185 190 

Pro Gly Ser Gly Gly Arg Lys Val Phe Glu Leu Val Gly Glu Pro Ser 
195 200 205 

He Tyr Cys Thr Ser Asn Asp Asp Gin Val Gly He Trp Ser Gly Pro 
210 215 220 
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Ala Pro Gin Cys lie He Pro Asn Lys Cys Thr Pro Pro Asn Val Glu 
2d£ , 230 235 240 

Asn His Gly Val Tyr He Asn Phe Thr Leu Leu Gin Thr Glu Ala Val 
245 250 255 

Asn Asp Tyr He Ala Val Trp Asp Gly Pro Asp Gin Asn Ser Pro Gin 
260 265 270 

Leu Gly Val Phe Ser Gly Asn Thr Ala Leu Glu Thr Gly He Leu Val 
275 280 285 

Ser Asp Asn Axg Ser Leu Phe Sex* Leu Asn Glu Val Val Glu Phe Arg 
290 ^ 295 300 

Cys Gin Pro Gly Phe Val Met Lys Gly Pro Arg Arg Val Lys Cys Gin 
305 ' 310 315 320 

Ala Leu Asn Lys Trp Glu Pro Glu Leu Pro Ser Cys Ser Arg Ala Tyr 
325 330 335 

Ser Ser Thr Asn Gin Val Leu Leu Lys Phe His Ser Asp Phe Ser Asn 
340 345 350 

Gly Gly Phe Phe Val Leu Asn Phe His Ala Phe Gin Leu Lys Val Cys 
355 360 365 

Gin Pro Pro Pro Asp Val Leu His Ala Glu Arg Thr Gin Arg Asp Lys 
370 375 380 

Asp Asn Phe Ser Pro Gly Gin Glu Val Phe Tyr Ser Cys Glu Pro Gly 
385 390 395 400 

Tyr Asp Leu Arg Gly Ala Ala Ser Met Arg Cys Thr Pro Gin Lys Cys 
405 410 415 

Gin Pro Pro Pro Ala Val Pro Gin Ala Glu Met Leu Thr Glu Asp Asp 
420 425 430 

Asp Phe Glu He Gly Asp Phe Val Lys Tyr Gin Cys His Pro Gly Tyr 
435 440 445 

Thr Leu Val Gly Thr Asp He Leu Thr Cys Lys Leu Ser Ser Gin Gly 
450 455 460 

Asp Trp Ser Pro Ala Ala Pro Thr Cys Glu Val Lys Ser Cys Asp Asp 
465 470 475 480 

Phe Met Gly Gin Leu Leu Asn Gly Arg Leu Gin Phe Glu Gly Ser Leu 
485 490 495 

Pro Thr Cys Glu Ala Gin Cys Pro Ala Asn Glu Val Arg Thr Gly Ser 
500 505 510 

Ser Gly Val He Leu Ser. Pro Gly Tyr Pro Gly Asn Tyr Phe Asn Ser 
515 520 525 

Gin Thr Cys Ser Trp Ser lie Lys Val Glu Pro Asn Leu Gin Leu Gly 
530 535 540 

Ala Lys Val Asp Phe Val Cys Asp Glu Gly Phe Gin Leu Lys Gly Ser 
545 550 555 560 
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Ser Ala Ser Tyr Cys Val Leu Ala Gly Met Glu Ser Asn Tyr Asn lie 
j 565 570 575 

Thr lie Phe Val Asp Thr Phe Gin Ser Glu Lys Gin Phe Asp Ala Leu 
580 585 590 

Glu Val Phe Asp Gly Ser Ser Gly Gin Ser Pro Leu Leu Val Val Leu 
* 595 600 605 

Ser Gly Asn His Thr Glu Gin Ser Asn Phe Thr Ser Arg Ser Leu Trp 
610 615 620 

Asn Ser Ser Val Pro Val Cys Glu Gin lie Phe Cys Pro Ser Pro Pro 
625 630 635 640 

Val lie Pro Asn Gly Arg Hie Thr Gly Lys Pro Leu Glu Val Phe Pro 
645 650 65.5 

Phe Gly Lys Asn Gin Leu Tyr Leu Arg Trp Ser Thr Asp His Ala Thr 
660 665 670 

Ser Lys Lys Gly Phe Lys lie Arg Tyr Ala Ala Pro Tyr Cys Ser Leu 
675 * 680 685 

Thr His Pro Leu Lys Asn Gly Gly lie Leu Asn Arg Thr Ala Gly Ala 
690 695 700 

Val Gly Ser Ala Val Asn Tyr Thr Cys Asp Pro His Pro Asp Arg Gly 
705 710 715 720 

Thr Ser Phe Asp Leu lie Gly Glu Ser Thr lie Arg Cys Thr Ser Asp 
725 730 735 

Pro Gin Gly Asn Gly Val Trp Ser Ser Pro Ala Pro Arg Cys Gly lie 
740 745 750 

Leu Gly His Cys Gin Lys Val His Tyr Phe Cys Lys Pro Gly Tyr Arg 
755 760 765 

Met Val Gly His Ser Asn Ala Thr Cys Arg Arg Asn Pro Leu Gly Met 
770 775 780 

Tyr Gin Trp Asp Ser Leu Thr Pro Leu Cys Gin Ala Val Ser Cys Gly 
785 790 795 800 

Ala Pro Asp His Phe Leu Phe Ala Lys Leu Lys Thr Gin Thr Asn Ala 
805 810 815 

Ser Asp Phe Pro lie Gly Thr Ser Leu Lys Tyr Glu Cys Arg Pro Glu 
820 825 830 

Tyr Tyr Gly Arg Pro Phe Ser lie Thr Cys Leu Asp Asn Leu Val lie 
835 ~ 840 845 

Pro Glu Ser Pro Gly Asn Gly Ser Phe Thr Gly Asn Glu Phe Thr Leu 
850 855 860 

Asp Ser Lys Val Val Tyr Glu Cys His Glu Gly Phe Lys Leu Glu Ser 
865 870 875 880 

Ser Gin Gin Ala Thr Ala Val Cys Gin Glu Asp Gly Leu Trp Ser Ser 
885 890 895 
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Pro Lys Asp Val Cys Lys Arg Lys Ser Cys Lys Thr Pro Pro Asp Pro 
900 905 910 

Val Asn Gly Met Val His Val lie Thr Asp lie Gin Val Gly Ser Arg 
915 920 925 

lie Asn Tyr Ser Cys Thr Thr Trp Ser Asn Lys Gly Lys Pro Pro Thr 
930 935 940 

Cys Lys Pro Val Ala Cys Pro Ser lie Glu Ala Gin Leu Ser Glu His 
945 950 955 960 

Val lie Trp Arg Leu Val Ser Gly Ser Leu Asn Glu Tyr Gly Ala Gin 
965 970 975 

Val Leu Leu Ser Cys Ser Pro Gly His Arg Leu lie Gly His Ser Ser 
980 985 990 

Ala Glu Cys lie Leu Ser Gly Asn Ala Ala His Trp Ser Thr. Lys Pro 
995 1000 1005 

Pro lie Cys Gin Arg lie Pro Cys Gly Leu Pro Pro Thr lie Ala 
1010 1015 1020 

Asn Gly Asp Phe lie Ser Thr Asn Gly Tyr Tyr Leu Glu Gly Trp 
1025 1030 1035 

Arg Leu Leu Arg Cys Gin Ala Asn Gly Thr Trp Asn lie Gly Asp 
1040 1045 1050 

Glu Arg Pro Ser Cys Arg Val He Ser Cys Gly Ser Leu Ser Phe 
1055 1060 1065 

Pro Pro Asn Gly Asn Lys He Gly Thr Leu Arg Glu Asn Phe His 
1070 1075 1080 

Tyr Gly Ser Val Val Thr Tyr Arg Cys Asn Pro Gly Ser Gly Gly 
1085 1090 1095 

Arg Lys Val Phe Glu Leu Val Gly Glu Pro Ser He. Tyr Cys Thr 
1100 1105 1110 

Ser Asn Asp Asp Gin Val Gly He Trp Ser Gly Pro Ala Pro Gin 
1115 1120 1125 

Thr Val Tyr Gly Ala Thr Ala He Phe Thr Cys Asn Thr Gly Tyr 
1130 1135 1140 

Thr Leu Val Gly Ser His Val Arg Glu Cys Leu- Ala Asn Gly Leu 
1145 1150 1155 

Trp Ser Gly Ser Glu Thr Arg Cys He Xaa Pro Asn Lys Cys Thr 
1160 1165 1170 

Pro Pro Asn Val Glu Asn Gly He Leu Val Ser Asp Asn Arg Ser 
1175 1180 1185 

Leu Phe Ser Leu Asn Glu Val Val Glu Phe Arg Cys Gin Pro Gly 
1190 1195 1200 

Phe Val Met Lys Gly Pro Arg Arg Val Lys Cys. Gin Cys Leu Ala 
1205 1210 * 1215 
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Gly His Cys Gly Ser Pro Asp Pro He Val Asn Gly His lie Ser 
1220 1225 1230 

Gly Asp Gly Phe Ser Tyr Arg Asp Thr Val Val Tyr Gin Cys Asn 
* 1235 * 1240 1245 

Pro Gly Phe Arg Leu Val Gly Thr Ser Val Arg He Cys Leu Ala 
1250 W 1255 1260 

Leu Asn Lys Trp Glu Pro Glu Leu Pro Ser Cys Ser Arg Val Cys 
1265 1270 1275 

Gin Pro Pro Pro Asp Val Leu His Ala Glu Arg Thr Gin Arg Asp 
1280 1285 1290 

Lys Asp Asn Phe Ser Pro Gly Gin Glu Val Phe Tyr Ser Cys Glu 
1295 1300 1305 

Pro Gly Tyr Gin Asp His Lys Trp Ser Gly Gin Thr Pro Val Cys 
1310 ~ 1315 1320 

Val Pro He Thr Cys Gly His Pro Gly Asn Pro Ala His Gly Phe 
1325 1330 1335 

Thr Asn Gly Ser. Glu Phe Asn Leu Asn Asp Val Val Asn Phe Thr 
1340 1345 1350 

Cys Asn Thr Gly Tyr Asp Leu Arg Gly Ala Ala Ser Met Arg Cys 
1355 1360 1365 

Thr Pro Gin Gly Asp Trp Ser Pro Ala Ala Pro Thr Cys Glu Val 
1370 1375 1380 

Lys Ser Cys Asp Asp Phe Met Gly Gin Leu Leu Asn Gly Arg Val 
1385 1390 1395 

Leu Phe Pro Val Asn Leu Gin Leu Leu Gin Gly Val Ser Arg Ala 
1400 1405 1410 

Gin Cys Arg Ser Asri Gly Gin Trp Ser Ser Pro Leu Pro Thr Cys 
1415 1420 1425 

Arg Val Val Asn Cys Ser Asp Pro Gly Phe Val Glu Asn Ala He 
1430 1435 1440 

Arg His Gly Gin Gin Asn Phe Pro Glu Ser Phe Glu Leu Gly Ala 
1445 1450 1455 

Lys Val Asp Phe Val Cys Asp Glu Gly Phe Gin Leu Lys Gly Ser 
1460 * 1465 1470 

Ser Ala Ser Tyr Cys Val Leu Ala Gly Met Glu Ser Leu- Trp Asn 
1475 1480 1485 

Ser Ser Val Pro Val Cys Glu Gin He Phe Cys Pro Ser Pro Pro 
1490 1495 1500 

Val He Tyr Gly Met Ser He Leu Tyr His Cys Lys Lys Gly Phe 
1505 1510 ' 1515 

Tyr Leu Leu Gly Ser Ser Ala Leu Thr Cys Met Ala Asn Gly Leu 
1520 1525 1530 
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Trp Asp Arg Ser Leu Pro Lys Cys Leu Ala lie Ser Cys Gly His 

1535 1540 1545 

Pro Gly Val Pro Pro Asn Gly Arg His Thr Gly Lys Pro Leu Glu 

1550 1555 ~ 1560 

Val Phe Pro Phe Gly Lys Thr Val Asn Tyr Thr Cys Asp Pro His 

1565 1570 1575 

Pro Asp Arg Gly . Thr Ser Phe Asp Leu lie Gly Glu Ser Thr He 

1580 1585 1590 

Arg Cys Thr Ser Asp Pro Gin Gly Asn Ala Asn Ala Val Leu Thr 

1595 1600 1605 

Gly Glu Leu Phe Thr Tyr Gly Ala Val Val His Tyr Ser Cys Arg 

1610 1615 1620 

Gly Ser Glu Ser Leu He Gly . Asn Asp Thr Arg Val Cys Gin Glu 

1625 1630 1635 

Asp Ser His Gly Val Trp Ser Ser Pro Ala Pro Arg Cys Gly He 

1640 1645 1650 

Leu Gly His Cys Gin Ala Pro Asp His Phe Leu Phe Ala Lys Leu 

1655 1660 1665 

Lys Thr Gin Thr Asn Ala Ser Asp Phe Pro He Gly Thr Ser Leu 

1670 1675 1680 

Lys Tyr Glu Trp Ser Gly Ala Leu Pro His Cys Thr Gly Asn Asn 

1685 1690 1695 

Pro Gly Phe Cys Gly Asp Pro Gly Thr Pro Ala His . Gly Ser Arg 

1700 1705 1710 

Leu Gly Asp Asp Phe Lys Thr Lys Ser Leu Leu Arg Phe Ser Cys 

1715 1720 1725 

Arg Pro Glu Tyr Tyr Gly Arg Pro Phe Ser He Thr Cys Leu Asp 

1730 " " 1735 1740 

Asn Leu Val Trp Ser Ser Pro Lys Asp Val Cys Lys Arg Lys Ser 

1745 1750 1755 

Cys Lys Thr Pro Pro Asp Pro Val Asn Gly Met Val His Val He. 

1760 1765 1770 

Thr Asp Cys Glu Met Gly His Gin Leu Arg Gly Ser Pro Glu Arg 

1775 1780 1785 

Thr Cys Leu Leu Asn Gly Ser Trp Ser Gly Leu Gin Pro Val Cys 

1790 1795 1800 

Glu Ala Val Ser Cys Gly Asn Pro Gly Thr Pro Thr Asn Gly Met 

1805 1810 * 1815 

He Val Ser Ser Asp Gly He Gin Val Gly Ser Arg He Asn Tyr 

1820 1825 1830 

Ser Cys Thr Thr Gly His Arg Leu He Gly His Ser Ser Ala Glu 

1835 1840 1845 
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Cys lie Leu Ser Gly Asn Ala Ala His Trp Ser Thr Lys Pro Pro 

1850 1855 1860 

lie Cys Gin Arg lie Pro Cys Gly Leu Pro Pro lie Leu Phe Ser 

1865 1870 " 1875 

Ser Ser Val lie Tyr Ala Cys Trp Glu Gly Tyr Lys Thr Ser Gly 

1880 1885 1890 

Leu Met Thr Arg. His Cys Thr Ala Asn Gly Thr Trp Thr Gly Thr 

1895 1900 1905 

Ala Pro Asp Cys Thr lie lie Ser Cys Gly Asp Pro Gly Thr lie 

1910 1915 1920 

Ala Asn Gly Asp Phe He Ser Thr Asn Arg Glu Asn Phe His Tyr 

1925 1930 1935 

Gly Ser Val Val Thr Tyr Arg . Cys Asn Pro Gly Ser Gly Gly Arg 

1940 1945 1950 

Lys Val Phe Glu Leu Val Gly Glu Pro Ser He Tyr Cys Thr Ser 

1955 1960 1965 

Asn Asp Asp Thr Leu Ala Asn Gly He Gin Phe Gly Thr Asp Phe 

1970 1975 1980 

Thr Phe Asn Lys Thr Val Ser Tyr Gin Cys Asn Pro Gly Tyr Val 

1985 1990 1995 

Met Glu Ala Val Thr Ser Ala Thr He Arg Cys Thr Lys Asp Gin 

2000 2005 2010 

Val Gly He Trp Ser Gly Pro Ala Pro Gin Cys He Xaa Pro Asn 

2015 2020 2025 

Lys Cys Thr Pro Pro Asn Val Glu Asn Gly He Leu Val Ser Asp 

2030 2035 " 2040 

Asn Arg Ser Leu Phe Ser Leu Asn Glu Val Val Glu Phe Arg Cys 

2045 2050 2055 

Gin Pro Gly Phe Gly Arg Trp Asn Pro Ser Lys Pro Val Cys Lys 

2060 2065 2070 

Ala Val Leu Cys Pro Gin Pro Pro Pro Val Gin Asn Gly Thr Val 

2075 2080 2085 

Glu Gly Ser Asp Phe Arg Trp Gly Ser Ser He Ser Tyr Ser Cys 

2090 2095 2100 

Met Asp Gly Tyr Val Met Lys Gly Pro Arg Arg Val Lys Cys Gin 

2105 2110 2115 

Ala Leu Asn Lys Trp Glu Pro Glu Leu Pro Ser Cys Ser Arg Val 

2120 2125 2130 

Cys Gin Pro Pro Pro Asp Val Leu His Ala Glu Arg Thr Gin Arg 

2135 2140 2145 

Asp Lys Asp Asn Phe Ser Pro Gly Gin Leu Ser His Ser Ala He 

2150 2155 ~ 2160 
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Leu Ser Cys Glu Gly Arg Gly Val Trp Lys Gly Glu lie Pro Gin 
21?65 2170 2175 

Cys Leu Pro Val Phe Cys Gly Asp Pro Gly lie Pro Ala Glu Gly 
2180 2185 2190 

Arg Leu Ser Gly Lys Ser Phe Thr Tyr Lys Gin Glu Val Phe Tyr 
2195 2200 2205 

ser Cys Glu Pro Gly Tyr Asp Leu Arg Gly Ala Ala Ser Met Arg 
2210 2215 2220 

Cys Thr Pro Gin Gly Asp Trp Ser Pro Ala Ala Pro Thr Cys Glu 
2225 2230 223S 

Val Lys Ser Cys Asp Asp Phe Met Gly Gin Leu Leu Ser Glu Val 
2240 2245 2250 

Phe Phe Gin Cys Lys Ser Pro Phe lie Leu Val Gly Ser Ser Arg 
2255 2260 2265 

Arg Val Cys Gin Ala Asp Gly Thr Trp Ser Gly He Gin Pro Thr 
2270 ' 2275 2280 

Cys He Asp Pro Ala His Asn Thr Cys Pro Asp Pro Gly Thr Pro 
2285 2290 2295 

His Asn Gly Arg Val Leu Phe Pro Val Asn Leu Gin Leu Gly Ala 
2300 2305 2310 

Lys Val Asp Phe Val Cys Asp Glu Gly Phe Gin Leu Lys Gly Ser 
2315 " 2320 2325 

Ser Ala Ser Tyr Cys Val Leu Ala Gly Met Glu Ser Leu Trp Asn 
2330 2335 2340 

Ser Ser Val Pro Val Cys Phe Gly He Gin Asn Ser Ser Arg Gly 
2345 2350 2355 

Tyr Glu Val Gly Ser Thr Val Phe Phe Arg Cys Arg Lys Gly Tyr 
2360 2365 2370 

His He Gin Gly Ser Thr Thr Arg Thr Cys Leu Ala Asn Leu Thr 
2375 2380 2385 

Trp Ser Gly He Gin Thr Glu Cys Glu Gin lie Phe Cys Pro Ser 
2390 - 2395 2400 

Pro Pro Val He Pro Asn Gly Arg His Thr Gly Lys Pro Leu Glu 
2405 2410 2415 

Val Phe Pro Phe Gly Lys Ala Val Asn Tyr Thr Cys Asp Pro His 
2420 2425 2430 

Pro Asp Arg Gly Thr Ser Phe Asp Leu He Gly Glu Ser He Pro 
2435 2440 2445 

His Ala Cys Arg Gin Pro Glu Thr Pro Ala His Ala Asp Val Arg 
2450 * 2455 2460 

Ala He Asp Leu Pro Thr Phe Gly Tyr Thr Leu Val Tyr Thr Cys 
2465 2470 2475 
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His Pro Gly Phe Phe Leu Ala Gly Gly Ser Thr lie Arg Cys Thr 
2480 2485 2490 

Ser Asp Pro Gin Gly Asn Gly Val Trp Ser Ser Pro Ala Pro Arg 
■~ 2495 2500 2505 

Cys Glu His Arg Thr Cys Lys Ala Asp Met Lys Trp Thr Gly Lys 
2510 2515 2520 

Ser Pro Val Cys 
2525 

<210> 6 

<211> ,10433 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (1) . . (9300) 



<400> 6 

acc ctg acg gtt ggt gat get ggg aag gtg gga gac acc aga teg gtc 48 

Thr Leu Thr Val Gly Asp Ala Gly Lys Val Gly Asp Thr Arg Ser Val 

1 5 10 - • 15 

ttg tac gtg etc acg gga tec agt gtt cct gac etc att gtg age atg 96 
Leu Tyr Val Leu Thr Gly Ser Ser Val Pro Asp Leu lie Val Ser Met 
20 25 30 

age aac cag atg tgg eta cat ctg cag teg gat gat age att ggc tea 144 

Ser Asn Gin Met Trp Leu His Leu Gin Ser Asp Asp Ser lie Gly Ser 
35 40 45 

cct ggg ttt aaa get gtt tac caa gaa att gaa aag gga ggg tgt ggg 192 

Pro Gly Phe Lys Ala Val Tyr Gin Glu lie Glu Lys Gly Gly Cys Gly 
50 55 60 

gat cct gga ate ccc gee tat ggg aag egg acg ggc age agt ttc etc 240 

Asp Pro Gly lie Pro Ala Tyr Gly Lys Arg Thr Gly Ser Ser Phe Leu 
65 70 75 80 

ca t 99^ gat aca etc acc ttt gaa tgc ccg gcg gee ttt gag ctg gtg 288 

His Gly Asp Thr Leu Thr Phe Glu Cys Pro Ala Ala Phe Glu Leu Val 
85 90 95 

999 9 a 9 aga gtt ate acc tgt cag cag aac aat cag tgg tct ggc aac 336 

Gly Glu Arg Val lie Thr Cys Gin Gin Asn Asn Gin Trp Ser Gly Asn 
100 105 110 

aag ccc age tgt gta ttt tea tgt ttc ttc aac ttt acg gca tea tct 384 

Lys Pro Ser Cys Val Phe Ser Cys Phe Phe Asn Phe Thr Ala Ser Ser 
115 120 125 
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ggg att att ctg tea cca aat tat cca gag gaa tat ggg aac aac atg 432 
Gly lie lie Leu Ser Pro Asn Tyr Pro Glu Glu Tyr Gly Asn Asn Met 
130 135 140 

aac tgt gtc tgg ttg att ate teg gag cca gga agt cga att cac eta 480 
Asn Cys Val Trp Leu lie He Ser Glu Pro Gly Ser Arg He His Leu 
145 150 155 160 

ate ttt aat gat ttt gat gtt gag cct caa ttt gac ttt etc gcg gtc 528 
He Phe Asn Asp Phe Asp Val Glu Pro Gin Phe Asp Phe Leu Ala Val 
165 170 .175 

aag gat gat ggc att tct gac ata act gtc ctg ggt act ttt tct ggc 576 
Lys Asp Asp Gly lie Ser Asp He Thr Val Leu Gly Thr Phe Ser Gly 
180 185 190 

aat gaa gtg cct tec cag ctg gee age agt ggg cat ata gtt cgc ttg 624 
Asn Glu Val Pro Ser Gin Leu Ala Ser Ser Gly His He Val Arg Leu 
195 200 205 

gaa ttt cag tct gac cat tec act act ggc aga ggg ttc aac ate act 672 
Glu Phe Gin Ser Asp His Ser Thr Thr Gly Arg Gly Phe Asn He Thr 
210 215 220 

tac acc aca ttt ggt cag aat gag tgc cat gat cct ggc att cct ata 720 
Tyr Thr Thr Phe Gly Gin Asn Glu Cys His Asp Pro Gly He Pro He 
225 230 235 240 

aac gga cga cgt ttt ggt gac agg ttt eta etc ggg age teg gtt tct 768 
Asn Gly Arg Arg Phe Gly Asp Arg Phe Leu Leu Gly Ser Ser Val Ser 
245 250 255 

ttc cac tgt gat gat ggc ttt gtc aag acc cag gga tec gag tec att 816 
Phe His Cys Asp Asp Gly Phe Val Lys Thr Gin Gly Ser Glu Ser He 
I 260 265 270 

acc tgc ata ctg caa gac ggg aac gtg gtc tgg age tec acc gtg ccc 864 
Thr Cys He Leu Gin Asp Gly Asn Val Val Trp Ser Ser Thr Val Pro 
275 280 285 

cgc tgt gaa get cca tgt ggt gga cat ctg aca gcg tec age gga gtc 912 
Arg Cys Glu Ala Pro Cys Gly Gly His Leu Thr Ala Ser Ser Gly Val 
290 295 300 

att ttg cct cct gga tgg cca gga tat tat aag gat tct tta cat tgt 960 
He Leu Pro Pro Gly Trp Pro Gly Tyr Tyr Lys Asp Ser Leu His Cys 
305 310 315 320 

gaa tgg ata att gaa gca aaa cca ggc cac tct ate aaa ata act ttt 1008 
Glu Trp He He Glu Ala Lys Pro Gly His Ser He Lys He Thr Phe 
325 330 335 

gac aga ttt cag aca gag gtc aat tat gac acc ttg gag gtc aga gat 1056 
Asp Arg Phe Gin Thr Glu Val Asn Tyr Asp Thr Leu Glu Val Arg Asp 
340 345 350 

ggg cca gee agt teg tec cca ctg ate ggc gag tac cac ggc acc cag 1104 
Gly Pro Ala Ser Ser Ser Pro Leu He Gly Glu Tyr His Gly Thr Gin 
355 360 365 

gca ccc cag ttc etc ate age acc ggg aac ttc atg tac ctg eta ttc 1152 
Ala Pro Gin Phe Leu He Ser Thr Gly Asn Phe Met Tyr Leu Leu Phe 
370 375 380 
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acc act gac aac age cgc tec age ate ggc ttc etc ate cac tat gag 1200 

Thr Thr Asp Asn Ser Arg Ser Ser lie Gly Phe Leu He His Tyr Glu 
385 390 395 400 

agt gtg acg ctt gag teg gat tec tgc ctg gac ccg ggc ate cct gtg 1248 
Ser Val Thr Leu Glu Ser Asp Ser Cys Leu Asp Pro Gly He Pro Val 
405 410 415 

aac grc cat cgc cac ggt gga gac ttt ggc ate agg tec aca gtg act 1296 
Asn Xaa His Arg His Gly Gly Asp Phe Gly He Arg Ser Thr Val Thr 
420 425 430 

ttc age tgt gac ccg ggg tac aca eta agt gac gac gag ccc etc gtc 1344 
Phe Ser Cys Asp Pro Gly Tyr Thr Leu Ser Asp Asp Glu Pro Leu Val 
435 440 4.45 

tgt gag agg aac cac cag tgg aac cac gec ttg ccc age tgc gac get 1392 
Cys Glu Arg Asn His Gin Trp Asn His Ala Leu Pro Ser Cys Asp Ala 
450 455 . 460 

eta tgt gga ggc tac ate caa ggg aag agt gga aca gtc ctt tct cct 1440 
Leu Cys Gly Gly Tyr He Gin Gly Lys Ser Gly Thr Val Leu Ser Pro 
465 ~ 470 475 480 

ggg ttt cca gat ttt tat cca aac tct eta aac ygc acg tgg ace att 1488 
Gly Phe Pro Asp Phe Tyr Pro Asn Ser Leu Asn Xaa Thr Trp Thr He 
485 490 • 495 

gaa gtg tct cat ggg aaa gga gtt caa atg ate ttt cac ace ttt cat 1536 
Glu Val Ser His Gly Lys Gly Val Gin Met He Phe His Thr Phe His 
500 * * 505 510 

ctt gag agt tec cac gac tat tta ctg ate aca gag gat gga agt ttt 1584 
Leu Glu Ser Ser His . Asp Tyr Leu Leu He Thr Glu Asp Gly Ser Phe 
515 520 525 

tec gag ccc gtt gee agg etc acc ggg teg gtg ttg cct cat acg ate 1632 
Ser Glu Pro Val Ala Arg Leu Thr Gly Ser Val Leu Pro His Thr He 
530 535 540 

aag gca ggc ctg ttt gga aac ttc act gee cag ctt egg ttt ata tea 1680 
Lys Ala Gly Leu Phe Gly Asn Phe Thr Ala Gin Leu Arg Phe He Ser 
545 550 555 560 

gac ttc tea att teg tac gag ggc ttc aat ate aca ttt tea gaa tat 1728 
Asp Phe Ser He Ser Tyr Glu Gly Phe Asn He Thr Phe Ser Glu Tyr 
565 570 575 

gac ctg gag cca tgt gat gat cct gga gtc cct gee ttc age cga aga 1776 
Asp Leu Glu Pro Cys Asp Asp Pro Gly Val Pro Ala Phe Ser Arg Arg 
580 " " 585 590 

att ggt ttt cac ttt ggt gtg gga gac tct ctg acg ttt tec tgc ttc 1824 
He Gly Phe His Phe Gly Val Gly Asp Ser Leu Thr Phe Ser Cys Phe 
595 600 605 

ctg gga tat cgt tta gaa ggt gee rcc aag ctt acc tgc ctg ggt ggg 1872 
Leu Gly Tyr Arg Leu Glu Gly Ala Xaa Lys Leu Thr Cys Leu Gly Gly 
610 615 ~ 620 

ggc cgc cgt gtg tgg agt gca cct ctg cca agg tgt gtg gee gaa tgt 1920 
Gly Arg Arg Val Trp Ser Ala Pro Leu Pro Arg Cys Val Ala Glu Cys 
625 630 635 640 
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gga gca agt gtc aaa gga aat gaa gga aca tta ctg tct cca aat ttt 1968 

Gly Ala Ser Val Lys Gly Asn Glu Gly Thr Leu Leu Ser Pro Asn Phe 
645 650 655 

cca tec aat tat gat aat aac cat gag tgt ate tat aaa ata gaa aca 2016 
Pro Ser Asn Tyr Asp Asn Asn His Glu Cys He Tyr Lys He Glu Thr 
660 665 670 

gaa gec ggc aag ggc ate cac ctt aga aca cga age ttc cag ctg ttt 2064 
Glu Ala Gly Lys Gly He His Leu Arg Thr Arg Ser Phe Gin Leu Phe 
675 680 685 

gaa gga gat act eta aag gta tat gat gga aaa gac agt tec tea cgt 2112 
Glu Gly Asp Thr Leu Lys Val Tyr Asp Gly Lys Asp Ser Ser Ser Arg 
690 695 700 

cca ctg ggc acg ttc act aaa aat gaa ctt ctg ggg ctg ate eta aac 2160 
Pro Leu Gly Thr Phe Thr Lys Asn Glu Leu Leu Gly Leu He Leu Asn 
705 710 715 720 

age aca tec aat cac ctr tgg eta gag ttc aac ace aat gga tct gac 2208 
Ser Thr Ser Asn His Xaa Trp Leu Glu Phe Asn Thr Asn Gly Ser Asp 
725 730 735 

acc gac caa ggt ttt caa etc acc tat acc agt ttt gat ctg gta aaa 2256 
Thr Asp Gin Gly Phe Gin Leu Thr Tyr Thr Ser Phe Asp Leu Val Lys 
740 745 750 

tgt gag gat ccg ggc ate cct aac tac ggc tat agg ate cgt gat gaa 2304 
Cys Glu Asp Pro Gly He Pro Asn Tyr Gly Tyr Arg He Arg Asp Glu 
755 760 765 

ggc cac ttt acc gac act gta gtt ctg tac agt tgc aac ccg ggg tac 2352 
Gly His Phe Thr Asp Thr Val Val Leu Tyr Ser Cys Asn Pro Gly Tyr 
770 775 780 

gee atg cat ggc age aac acc ctg acc tgt ttg agt gga gac agg aga 2400 
Ala Met His Gly Ser Asn Thr Leu Thr Cys Leu Ser Gly Asp Arg Arg 
785 790 795 800 

gtg tgg gac aaa cca eta cct teg tgc ata gcg gaa tgt ggt ggt cag 2448 
Val Trp Asp Lys Pro Leu Pro Ser Cys He Ala Glu Cys Gly Gly Gin 
805 810 815 

ate cat gca gee aca tea gga cga ata ttg tec cct ggc tat cca get 2496 
He His Ala Ala Thr Ser Gly Arg He Leu Ser Pro Gly Tyr Pro Ala 
820 825 830 

ccg tat gac aac aac etc cac tgc acc tgg att ata gag gca gac cca 2544 
Pro Tyr Asp Asn Asn Leu His Cys Thr Trp He He Glu Ala Asp Pro 
835 840 845 

gga aag acc att age etc cat ttc att gtt ttc gac acg gag atg get 2592 
Gly Lys Thr He Ser Leu His Phe He Val Phe Asp Thr Glu Met Ala 
850 855 860 

cac gac ate etc aag gtc tgg gac ggg ccg gtg gac agt gac ate ctg 2 64 0 

His Asp He Leu Lys Val Trp Asp Gly Pro Val Asp Ser Asp He Leu 
865 870 875 880 

ctg aag gag tgg agt ggc tec gee ctt ccg gag gac ate cac age acc 2688 
Leu Lys Glu Trp Ser Gly Ser Ala Leu Pro Glu Asp He His Ser Thr 
885 890 895 
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ttc aac tea etc ace ctg cag ttc gac age gac ttc ttc ate age aag 
Phe Asn Ser Leu Thr Leu Gin Phe Asp Ser Asp Phe Phe lie Ser Lys 
900 905 910 



2736 



tct ggc ttc tec ate cag ttc tec acc tea att gca gee acc tgt aac 
Ser Gly Phe Ser lie Gin Phe Ser Thr Ser lie Ala Ala Thr Cys Asn 
915 920 925 



2784 



gat cca ggt atg ccc caa aat ggc acc cgc tat gga gac age aga gag 
Asp Pro Gly Met Pro Gin Asn Gly Thr Arg Tyr Gly Asp Ser Arg Glu 
930 935 940 



2832 



get gga gac acc gtc aca ttc cag tgt gac cct ggc tat cag etc caa 
Ala Gly Asp Thr Val Thr Phe Gin Cys Asp Pro Gly Tyr Gin Leu Gin 
945 950 955 960 



2880 



djga caa gee aaa ate acc tgt gtg cag ctg aat aac egg ttc ttt tgg 
Gly Gin Ala Lys lie Thr Cys Val Gin Leu Asn Asn Arg Phe Phe Trp 
965 970 975 



2928 



caa cca gac cct cct aca tgc ata get get tgt gga ggg aat ctg acg 
Gin Pro Asp Pro Pro Thr Cys lie Ala Ala Cys Gly Gly Asn Leu Thr 
980 985 990 



2976 



ggc cca gca ggt gtt att ttg tea ccc aac tao cca cag ccg tat cct 
Gly Pro Ala Gly Val lie Leu Ser Pro Asn Tyr Pro Gin Pro Tyr Pro 
995 1000 1005 



3024 



cct ggg aag gaa tgt gac tgg aga gta aaa gtg aac ccg gac ttt 
Pro Gly Lys Glu Cys Asp Trp Arg Val Lys Val Asn Pro Asp Phe 
1010 " 1015 1020 



3069 



gtc ate gee ttg ata ttc aaa agt ttc aac atg gag ccc age tat 
Val He Ala Leu He Phe Lys Ser Phe Asn Met Glu Pro Ser Tyr 
1025 1030 1035 



3114 



gac ttc eta cac ate tat gaa ggg gaa gat tec aac age ccc etc 
Asp Phe Leu His He Tyr Glu Gly Glu Asp Ser Asn Ser Pro Leu 
1040 1045 1050 



3159 



att ggg agt tac cag ggc tct cag gee cca gaa aga 
He Gly Ser Tyr Gin Gly Ser Gin Ala Pro Glu Arg 
1055 1060 1065 



ata gag agt 
He Glu Ser 



3204 



age gga aac age ctg ttt ctg 
Ser Gly Asn Ser Leu Phe Leu 
1070 1075 



gca ttt egg agt gat gee tec gtg 
Ala Phe Arg Ser Asp Ala Ser Val 
1080 



3249 



ggc ctt tea ggg ttc gec att gaa ttt aaa gag aaa cca egg gaa 
Gly Leu Ser Gly Phe Ala He Glu Phe Lys Glu Lys Pro Arg Glu 
1085 1090 1095 



3294 



get tgt ttt gac cca gga aat 
Ala Cys Phe Asp Pro Gly Asn 
1100 1105 



ata atg aat ggg aca 
He Met Asn Gly Thr 
1110 



aga gtt gga 
Arg Val Gly 



3339 



aca gac ttc aag ctt ggc tec acc ate acc tac cag 
Thr Asp Phe Lys Leu Gly Ser Thr He Thr Tyr Gin 
1115 1120 1125 



tgt gac tct 
Cys Asp Ser 



3384 



ggc tat aag att ctt gac ccc 
Gly Tyr Lys He Leu Asp Pro 
1130 1135 



tea tec ate acc tgt gtg att ggg 
Ser Ser lie Thr Cys Val He Gly 
1140 



3429 
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gct gat ggg aaa ccc tec tgg gac caa gtg ctg ccc tec tgc aat 3474 
ftla Asp Gly Lys Pro Ser Trp Asp Gin Val Leu Pro Ser Cys Asn 
1145 1150 1155 

get ccc tgt gga ggc cag tac acg gga tea gaa ggg gta gtt tta 3519 
Ala Pro Cys Gly Gly Gin Tyr Thr Gly Ser Glu Gly Val Val Leu 
1160 1165 1170 

tea cca aac tac ccc cat aat tac aca get ggt caa ata tgc etc 3564 
Ser Pro Asn Tyr Pro His Asn Tyr Thr Ala Gly Gin lie Cys Leu 
1175 1180 1185 

tat tec ate acg gta cca aag gaa ttc gtg gtc ttt gga cag ttt 3609 
Tyr Ser He Thr Val Pro Lys Glu Phe Val Val Phe Gly Gin Phe 
11,90 1195 1200 

gee tat ttc cag aca gee ctg aat gat ttg gca gaa tta ttt gat 3654 
Ala Tyr Phe Gin Thr Ala Leu Asn Asp Leu Ala Glu Leu Phe Asp 
1205 1210 1215 

gga acc cat gca cag gee aga ctt etc age tea etc teg ggg tct 3699 
Gly Thr His Ala Gin Ala Arg Leu Leu Ser Ser Leu Ser Gly Ser 
1220 1225 1230 

cac tea ggg gaa aca ttg ccc ttg get acg tea aat caa att ctg 3744 
His Ser Gly Glu Thr Leu Pro Leu Ala Thr Ser Asn Gin He Leu 
1235 1240 1245 

etc cga ttc agt gca aag age ggt gee tct gec cgc ggc ttc cac 3789 
Leu Arg Phe Ser Ala Lys Ser Gly Ala Ser Ala Arg Gly Phe His 
1250 1255 1260 

ttc gtg tat caa get gtt cct cgt acc agt gac acc caa tgc age 3834 
Phe Val Tyr Gin Ala Val Pro Arg Thr Ser Asp Thr Gin Cys Ser 
1265 1270 1275 

tct gtc ccc gag ccc aga tac gga agg aga att ggt tct gag ttt 3879 
Ser Val Pro Glu Pro Arg Tyr Gly Arg Arg He Gly Ser Glu Phe 
1280 1285 1290 

tct gee ggc tec ate gtc cga ttc gag trc aac ccg gga tac ctg 3924 
Ser Ala Gly Ser He Val Arg Phe Glu Xaa Asn Pro Gly Tyr Leu 
1295 1300 1305 

ctt cag ggt tec acg gcg etc cac tgc cag tec gtg ccc aac gec 3969 
Leu Gin Gly Ser Thr Ala Leu His Cys Gin Ser Val Pro Asn Ala 
1310 1315 1320 

ttg gca cag tgg aac gac acg ate ccc age tgt gtg gta ccc tgc 4014 
Leu Ala Gin Trp Asn Asp Thr He Pro Ser Cys Val Val Pro Cys 
1325 1330 1335 

a 9t ggc aat ttc act caa cga aga ggt aca ate ctg tec ccc ggc 4059 
Ser Gly Asn Phe Thr Gin Arg Arg Gly Thr He Leu Ser Pro Gly 
1340 1345 1350 

tac cct gag cca tac gga aac aac ttg aac tgt ata tgg aag ate 4104 
Tyr Pro Glu Pro Tyr Gly Asn Asn Leu Asn Cys He Trp Lys He 
1355 1360 1365 

ata gtt acg gag ggc teg gga att cag ate caa gtg ate agt ttt 4149 
He Val Thr Glu Gly Ser Gly He Gin He Gin Val He Ser Phe 
1370 1375 1380 
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gcc acg gag cag aac tgg gac tec ctt gag ate cac 
Ala Thr Glu Gin Asn Trp Asp Ser Leu Glu lie His 
1385 1390 1395 



gat ggt ggg 
Asp Gly Gly 



4194 



gat gtg acc gca ccc aga ctg 
Asp Val Thr Ala Pro Arg Leu 
1400 1405 



gga age ttc tea ggc 
Gly Ser Phe Ser Gly 
1410 



acc aca gta 
Thr Thr Val 



4239 



ccg gca 
Pro Ala 
1415 



ctg ctg aac agt act 
Leu Leu Asn Ser Thr 
1420 



tec aac caa etc tac 
Ser Asn Gin Leu Tyr 
1425 



ctg cat ttc 
Leu His Phe 



4284 



cag tct gac att agt gtg gca 
Gin Ser Asp lie Ser Val Ala 
1430 1435 



get get ggt ttc cac 
Ala Ala Gly Phe His 
1440 



ctg gaa tac 
Leu Glu Tyr 



4329 



aaa act gta ggt ctt get gca tgc caa gaa cca gcc 
Lys Thr Val Gly Leu Ala Ala Cys Gin Glu Pro Ala 
1445 1450 1455 



etc ccc age 
Leu Pro Ser. 



4374 



aac age ate aaa ate gga gat egg tac atg gtg aac 
Asn Ser lie Lys He Gly Asp Arg Tyr Met Val Asn 
1460 1465 1470 



gac gtg etc 
Asp Val Leu 



4419 



tec ttc cag tgc gag ccc ggg 
Ser Phe Gin Cys Glu Pro Gly 
1475 1480 



tac acc ctg cag ggc cgt tec cac 
Tyr Thr Leu Gin Gly Arg Ser His 
1485 



4464 



att tec tgt atg cca ggg acc 
He Ser Cys Met Pro Gly Thr 
1490 1495 



gtt cgc cgt tgg aac tat ccg tct 
Val Arg Arg Trp Asn Tyr Pro Ser 
1500 



4509 



ccc ctg tgc att gca acc tgt gga ggg acg ctg age 
Pro Leu Cys He Ala Thr Cys Gly Gly Thr Leu Ser 
1505 1510 1515 



acc ttg ggt 
Thr Leu Gly 



4554 



ggt gtg ate ctg age ccc ggc 
Gly Val He Leu Ser Pro Gly 
1520 1525 



ttc cca ggt tct tac 
Phe Pro Gly Ser Tyr 
1530 



ccc aac aac 
Pro Asn Asn 



4599 



tta gac tgc acc tgg agg ate 
Leu Asp Cys Thr Trp Arg He 
1535 1540 



tea tta ccc ate ggc 
Ser Leu Pro He Gly 
1545 



tat ggt gca 
Tyr Gly Ala 



4644 



cat att cag ttt ctg aat ttt 
His He Gin Phe Leu Asn Phe 
1550 1555 



tct acc gaa get aat 
Ser Thr Glu Ala Asn 
1560 



cat gac ttc 
His Asp Phe 



4689 



ctt gaa att caa aat gga cct tac cac acc age ccc 
Leu Glu He Gin Asn Gly Pro Tyr His Thr Ser Pro 
1565 1570 1575 



atg att gga 
Met He Gly 



4734 



caa ttt age ggc acg gat etc 
Gin Phe Ser Gly Thr Asp Leu 
1580 1585 



ccc gcg gcc ctg ctg 
Pro Ala Ala Leu Leu 
1590 



age aca acg 
Ser Thr Thr 



4779 



cat gaa acc etc ate cac ttt tat agt gac cat teg caa aac egg 
His Glu Thr Leu He His Phe Tyr Ser Asp His Ser Gin Asn Arg 
1595 1600 1605 



4824 



caa gga ttt aaa ctt get tac 
Gin Gly Phe Lys Leu Ala Tyr 
1610 1615 



caa gcc tat gaa tta cag aac tgt 
Gin Ala Tyr Glu Leu Gin Asn Cys 
1620 



4869 
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cca gat cca ccc cca ttt cag aat ggg tac atg ate 
Pro Asp Pro Pro Pro Phe Gin Asn Gly Tyr Met lie 
1625 1630 1635 



aac teg gat 
Asn Ser Asp 



4914 



tac age gtg ggg caa tea gta 
Tyr Ser Val Gly Gin Ser Val 
1640 1645 



tct ttc gag tgt tat 
Ser Phe Glu Cys Tyr 
1650 



cct ggg tac 
Pro Gly Tyr 



4959 



att eta ata ggc cat cct gtc etc act tgt cag cat ggg ate aac 
lie Leu lie Gly His Pro Val Leu Thr Cys Gin His Gly lie Asn 
1655 1660 1665 



5004 



aga aac tgg aac tac cct ttt 
Arg Asn Trp Asn Tyr Pro Phe 
1670 ~ 1675 



cca aga tgt gat gee cct tgt ggg 
Pro Arg Cys Asp Ala Pro Cys Gly 
1680 



. 5049 



tac aac gta act tct cag aac ggc acc ate tac tec 
Tyr Asn Val Thr Ser Gin Asn Gly Thr lie Tyr Ser 
1685 1690 1695 



cct ggc ttt 
Pro Gly Phe 



5094 



cct gat gag tat ccg ate ctg aag gac tgc att tgg 
Pro Asp Glu Tyr Pro lie Leu Lys Asp Cys He Trp 
1700 1705 1710 



etc ate acg 
Leu He Thr 



5139 



gtg cct cca ggg cac gga gtt tac ate aac ttc acc 
Val Pro Pro Gly His Gly Val Tyr He Asn Phe Thr 
1715 1720 1725 



ctg tta cag 
Leu Leu Gin 



5184 



acg gaa get gtc aac gat tac 
Thr Glu Ala Val Asn Asp Tyr 
1730 1735 



att get gtt tgg gac 
He Ala Val Trp Asp 
1740 



ggt ccc gat 
Gly Pro Asp 



5229 



cag aac tea ccc cag ctg gga 
Gin Asn Ser Pro Gin Leu Gly 
1745 1750 



gtt ttc agt ggc aac 
Val Phe Ser Gly Asn 
1755 



aca gee etc 
Thr Ala Leu 



5274 



gaa acg gcg tat age tec acc aac caa gtc ctg etc 
Glu Thr Ala Tyr Ser Ser Thr Asn Gin Val Leu Leu 
1760 1765 1770 



aag ttc cac 
Lys Phe His 



5319 



a 9c gac ttt tea aat gga ggc ttc ttt gtc etc aat 
Ser Asp Phe Ser Asn Gly Gly Phe Phe Val Leu Asn 
1775 1780 1785 



ttc cac gca 
Phe His Ala 



5364 



ttt cag etc aag aaa tgt caa 
Phe Gin Leu Lys Lys Cys Gin 
1790 1795 



cct ccc cca gcg gtt 
Pro Pro Pro Ala Val 
1800 



cca cag gca 
Pro Gin Ala 



5409 



gaa atg ctt act gag gat gat gat ttc gag ata gga 
Glu Met Leu Thr Glu Asp Asp Asp Phe Glu lie Gly 
1805 1810 1815 



gat ttt gtg 
Asp Phe Val 



5454 



aag tac cag tgc cac ccc ggg tac acc ttg gtg ggg acc gac att 
Lys Tyr Gin Cys His Pro Gly Tyr Thr Leu Val Gly Thr Asp He 
1820 1825 1830 



5499 



ctg act tgc aag etc agt tec cag ttg cag ttt gag 
Leu Thr Cys Lys Leu Ser Ser Gin Leu Gin Phe Glu 
1835 1840 1845 



ggt tct etc 
Gly Ser Leu 



5544 



cca aca tgt gaa gca caa tgc cca gca aat gaa gtc 
Pro Thr Cys Glu Ala Gin Cys Pro Ala Asn Glu Val 
1850 1855 1860 



egg act gga 
Arg Thr Gly 



5589 
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tea teg gga gtc att etc agt cca ggg tat ccg ggt aat tat ttt 

Ser Ser Gly Val lie Leu Ser Pro Gly Tyr Pro Gly Asn Tyr Phe 
1865 1870 1875 

aac tec cag act tgc tct tgg agt att aaa gtg gaa cca aac tac 

Asn Ser Gin Thr Cys Ser Trp Ser He Lys Val Glu Pro Asn Tyr 
1880 1885 1890 



5634 



5679 



aac att acc ate ttt gtg gac aca ttt caa agt gaa 
Asn He Thr lie. Phe Val Asp Thr Phe Gin Ser Glu 
1895 1900 1905 



aag cag ttt 
Lys Gin Phe 



gat gca ctg gaa gtg ttt gat 
Asp Ala Leu Glu Val Phe Asp 
1910 1915 



ggt tct tct ggg caa agt cct ctg 
Gly Ser Ser Gly Gin Ser Pro Leu 
1920 



5724 



5769 



eta gta gtc tta agt ggg aat cat act gaa caa tea aat ttt aca 
Leu Val Val Leu Ser Gly Asn His Thr Glu Gin Ser Asn Phe Thr 
1925 1930 1935 



age agg agt aat cag tta tat 
Ser Arg Ser Asn Gin Leu Tyr 
1940 1945 



etc cgc tgg tec act gac cat gee 
Leu Arg Trp Ser Thr Asp His Ala 
1950 



acc agt aag aaa gga ttc aag att cgc tat gca gca cct tac tgc 

Thr Ser Lys Lys Gly Phe Lys He Arg Tyr Ala Ala Pro Tyr Cys 
1955 I960 1965 

agt ttg acc cac ccc ctg aag aat ggg ggt att eta aac agg act 

Ser Leu Thr His Pro Leu Lys Asn Gly Gly He Leu Asn Arg Thr 
1970 1975 1980 

gca gga gcg gtt gga age aaa gtg cat tat ttt tgc aag cct gga 

Ala Gly Ala Val Gly Ser Lys Val His Tyr Phe Cys Lys Pro Gly 
1985 1990 1995 

tac cga atg gtc ggc cac age aat gca acc tgt aga cga aac cca 

Tyr Arg Met Val Gly His Ser Asn Ala Thr Cys Arg Arg Asn Pro 
2000 2005 2010 

ctt ggc atg tac cag tgg gac tec etc acg cca etc tgc cag get 

Leu Gly Met Tyr Gin Trp Asp Ser Leu Thr Pro Leu Cys Gin Ala 
2015 2020 2025 



5814 



5859 



5904 



5949 



5994 



6039 



6084 



gtg tec tgt gga ate cca gaa tec cca gga aac ggt 
Val Ser Cys Gly lie Pro Glu Ser Pro Gly Asn Gly 
2030 2035 2040 



tea ttt acc 
Ser Phe Thr 



6129 



ggg aac gag ttc act ttg gac agt aaa gtg gtc tat gaa tgt cat 

Gly Asn Glu Phe Thr Leu Asp Ser Lys Val Val Tyr Glu Cys His 
2045 2050 2055 

gag ggc ttc aag ctt gaa tec age cag caa gca aca gee gtg tgt 

Glu Gly Phe Lys Leu Glu Ser Ser Gin Gin Ala Thr Ala Val Cys 
2060 2065 2070 

caa gaa gat ggg ctg tgg agt aac aag ggg aag ccg ccc acg tgt 

Gin Glu Asp Gly Leu Trp Ser Asn Lys Gly Lys Pro Pro Thr Cys 
2075 2080 2085 

aag ccg gtc get tgc ccc age att gaa get cag etc tea gaa cat 

Lys Pro Val Ala Cys Pro Ser He Glu Ala Gin Leu Ser Glu His 
2090 2095 2100 



6174 



6219 



6264 



6309 
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gtc ate tgg agg ctg gtt tea 
Vcfcl He Trp Arg Leu Val Ser 
2105 2110 



gga tec ttg aat gag 
Gly Ser Leu Asn Glu 
2115 



tac ggt get 
Tyr Gly Ala 



6354 



caa gta ttg ctg age tgc agt cct ggt tac tac tta 
Gin Val Leu Leu Ser Cys Ser Pro Gly Tyr Tyr Leu 
2120 2125 " 2130 



gaa ggc tgg 
Glu Gly Trp 



6399 



agg etc ctg egg tgc cag gec aat ggg acg tgg aac ata gga gat 
Arg Leu Leu Arg Cys Gin Ala Asn. Gly Thr Trp Asn He Gly Asp 
2135 2140 2145 



6444 



gag agg cca age tgt cga gtt 
Glu Arg Pro Ser Cys Arg Val 
2150 2155 



ate teg tgt gga age 
He Ser Cys Gly Ser 
2160 



ctt tec ttt 
Leu Ser Phe 



6489 



ccc cca aat ggc aac aag att gga acg ttg aca gtt tat ggg gee 
Pro Pro Asn Gly Asn Lys He Gly Thr Leu Thr Val Tyr Gly Ala 
2165 , 2170 2175 



6534 



aca get ata ttt acg tgc aac acc ggc tac acg ctt gtg ggg tct 
Thr Ala He Phe Thr Cys Asn Thr Gly Tyr Thr Leu Val Gly Ser 
2180 2185 2190 



6579 



cat gtc aga gag tgc ttg gca aat ggg etc tgg age ggc age gaa 
His Val Arg Glu Cys Leu Ala Asn Gly Leu Trp Ser Gly Ser Glu 
2195 2200 2205 



6624 



act cga tgt ctg get ggc cac 
Thr Arg Cys Leu Ala Gly His 
2210 2215 



t9c 99t tec cca gac ccg att gtg 
Cys Gly Ser Pro Asp Pro He Val 
2220 



6669 



aac 99t cac att agt gga gat ggc ttc agt tac aga gac acg gtg 
Asn Gly His He Ser Gly Asp Gly Phe Ser Tyr Arg Asp Thr Val 
2225 2230 2235 



6714 



gtt tac cag tgc aat cct ggt 
Val Tyr Gin Cys Asn Pro Gly 
2240 2245 



ttc egg ctt gtg gga act tec gtg 
Phe Arg Leu Val Gly Thr Ser Val 
2250 



6759 



agg ata tgc ctg caa gac cac aag tgg tct gga caa acg cct gtc 
Arg He Cys Leu Gin Asp His Lys Trp Ser Gly Gin Thr Pro Val 
2255 2260 ~ 2265 



6804 



tgt gtc ccc ate aca tgt ggt 
Cys Val Pro He Thr Cys Gly 
2270 2275 



cac cct gga aac cct gee cac gga 
His Pro Gly Asn Pro Ala His Gly 
2280 



6849 



ttc act aat ggc agt gag ttc aac ctg aat gat gtc gtg aat ttc 
Phe Thr Asn Gly Ser Glu Phe Asn Leu Asn Asp Val Val Asn Phe 
2285 2290 2295 



6894 



acc tgc aac acg ggc tat ttg ctg cag ggc gtg tct cga gee cag 
Thr Cys Asn Thr Gly Tyr Leu Leu Gin Gly Val Ser Arg Ala Gin 
2300 2305 2310 



6939 



tgt egg age aac ggc . cag tgg 
Cys Arg Ser Asn Gly Gin Trp 
2315 2320 



a 9t age cct ctg ccc acg tgt cga 
Ser Ser Pro Leu Pro Thr Cys Arg 
2325 



6984 



gtg gtg aac tgt tct gat cca 
Val Val Asn Cys Ser Asp Pro 
2330 2335 



ggc ttt gtg gaa aat gec att cgt 
Gly Phe Val Glu Asn Ala He Arg 
2340 



7029 
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cac ggg caa cag aac ttc cct gag agt ttt gag tat 
HiB Gly Gin Gin Asn Phe Pro Glu Ser Phe Glu Tyr 
2345 2350 2355 



ate ctg tac cat tgc aag aag 
lie Leu Tyr His Cys Lys Lys 
2360 2365 



gga ttt tac ttg ctg 
Gly Phe Tyr Leu Leu 
2370 



gec ttg acc tgt atg gca aat ggc tta tgg gac cga 
Ala Leu Thr Cys Met Ala Asn Gly Leu Trp Asp Arg 
2375 2380 2385 



gga atg agt 
Gly Met Ser 



gga tct tea 
Gly Ser Ser 



tec ctg ccc 
Ser Leu Pro 



aag tgt ttg get ata teg tgt 
Lys Cys Leu Ala He Ser Cys 
2390 2395 



gga cac cca ggg gtc cct gec aac 
Gly His Pro Gly Val Pro Ala Asn 
2400 



gec gtc 
Ala Val 
2405 

tac tec 
Tyr Ser 
2420 

gtg tgc 
Val Cys 
2435 



etc act gga gag ctg ttt acc tat ggc gee gtc gtg cac 
Leu Thr Gly Glu Leu Phe Thr Tyr Gly Ala Val Val His 
2410 2415 

tgc aga ggg age gag age etc ata ggc aac gac acg aga 
Cys Arg Gly Ser Glu Ser Leu He Gly Asn Asp Thr Arg 
2425 2430 



cag gaa gac agt cac 
Gin Glu Asp Ser His 
2440 



tgg age ggg gca ctg ccc cac tgc 
Trp Ser Gly Ala Leu Pro His Cys 
2445 



aca gga aat aat cct gga ttc 
Thr Gly Asn Asn Pro Gly Phe 
2450 2455 



tgt ggt gat ccg ggg 
Cys Gly Asp Pro Gly 
2460 



acc cca gca 
Thr Pro Ala 



cat ggg tct egg ctt ggt gat gac ttt aag aca aag agt ctt etc 
His Gly Ser Arg Leu Gly Asp Asp Phe Lys Thr Lys Ser Leu Leu 
2465 2470 2475 

cgc ttc tec tgt gaa atg ggg cac cag ctg agg ggc tec cct gaa 
Arg Phe Ser Cys Glu Met Gly His Gin Leu Arg Gly Ser Pro Glu 
2480 " 2485 2490 



cgc acg 
Arg Thr 
2495 



tgt ttg etc aat ggg tea tgg tea gga ctg cag ccg gtg 
Cys Leu Leu Asn Gly Ser Trp Ser Gly Leu Gin Pro Val 
2500 . 2505 



tgt gag gee gtg tec tgt ggc aac cct ggc aca ccc acc aac gga 
Cys Glu Ala Val Ser Cys Gly Asn Pro Gly Thr Pro Thr Asn Gly 
2510 2515 2520 

atg att gtc agt agt gat ggc att ctg ttc tec age teg gtc ate 
Met He Val Ser Ser Asp Gly He Leu Phe Ser Ser Ser Val He 
2525 2530 2535 



tat gec tgc tgg gaa ggc tac 
Tyr Ala Cys Trp Glu Gly Tyr 
2540 2545 



aag acc tea ggg etc atg aca egg 
Lys Thr Ser Gly Leu Met Thr Arg 
2550 



cat tgc aca gee aat ggg acc tgg aca ggc act get 
His Cys Thr Ala Asn Gly Thr Trp Thr Gly Thr Ala 
2555 2560 2565 



ccc gac tgc 
Pro Asp Cys 



aca att ata agt tgt ggg gat 
Thr He He Ser Cys Gly Asp 
2570 2575 



cca ggc aca eta gca aat ggc ate 
Pro Gly Thr Leu Ala Asn Gly He 
2580 



7074 



7119 



7164 



7209 



7254 



7299 



7344 



7389 



7434 



7479 



7524 



7569 



7614 



7659 



7704 



7749 
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cag ttt ggg acc gac ttc acc ttc aac aag act gtg age tat cag 7794 
Gin Phe Gly Thr Asp Phe Thr Phe Asn Lys Thr Val Ser Tyr Gin 
2585 2590 2595 

tgt aac cca ggc tat gtc atg gaa gca gtc aca tec gec act att 7839 
Cys Asn Pro Gly Tyr Val Met Glu Ala Val Thr Ser Ala Thr lie 
2600 2605 2610 

cgc tgt acc aaa gac ggc agg tgg aat ccg age aaa cct gtc tgc 7884 
Arg Cys Thr Lys. Asp Gly Arg Trp Asn Pro Ser Lys Pro Val Cys 
2615 2620 2625 

aaa gec gtg ctg tgt cct cag ccg ccg ccg gtg cag aat gga aca 7929 
Lys Ala Val Leu Cys Pro Gin Pro Pro Pro Val Gin Asn Gly Thr 
2630 2635 2640 

gtg gag gga agt gat ttc cgc tgg ggc tec age ata agt tac age 7974 
Val Glu Gly Ser Asp Phe Arg Trp Gly Ser Ser lie Ser Tyr Ser 
2645 2650 2655 

tgc atg gac ggt tac cag etc tct cac tec gee ate etc tec tgt 8019 
Cys Met Asp Gly Tyr Gin Leu Ser His Ser Ala lie Leu Ser Cys 
2660 2665 2670 

gaa ggt cgc ggg gtg tgg aaa gga gag ate ccc cag tgt etc cct 8064 
Glu Gly Arg Gly Val Trp Lys Gly Glu lie Pro Gin Cys Leu Pro 
2675 2680 2685 

gtg ttc tgc gga gac cct ggc ate ccc gca gaa ggg cga ctt agt 8109 
Val Phe Cys Gly Asp Pro Gly lie Pro Ala Glu Gly Arg Leu Ser 
2690 2695 2700 

ggg aaa agt ttc acc tat aag tec gaa gtc ttc ttc cag tgc aaa 8154 
Gly Lys Ser Phe Thr Tyr Lys Ser Glu Val Phe Phe Gin Cys Lys 
2705 2710 2715 

tct cca ttt ata etc gtg gga tec tec aga aga gtc tgc caa get 8199 
Ser Pro Phe lie Leu Val Gly Ser Ser Arg Arg Val Cys Gin Ala 
2720 2725 2730 

gac ggc acg tgg age ggc ata caa ccc acc tgc att. gat cct get 8244 
Asp Gly Thr Trp Ser Gly lie Gin Pro Thr Cys lie Asp Pro Ala 
2735 2740 2745 

cat aac acc tgc cca gac cct ggt acg cca cac ttt gga ata cag 8289 
His Asn Thr Cys Pro Asp Pro Gly Thr Pro His Phe Gly lie Gin 
2750 2755 2760 

aat age tec aga ggc tat gag gtt gga age acg gtt ttt ttc agg 8334 
Asn Ser Ser Arg Gly Tyr Glu Val Gly Ser Thr Val Phe Phe Arg 
2765 2770 2775 

tgc aga aaa ggc tac cat att caa ggt tec acg act cgc acc tgc 8379 
Cys Arg Lys Gly Tyr His lie Gin Gly Ser Thr Thr Arg Thr Cys 
2780 2785 2790 

ctt gee aat tta aca tgg agt ggg ata cag acc gaa tgt ata cct 8424 
Leu Ala Asn Leu Thr Trp Ser Gly He Gin Thr Glu Cys He Pro 
2795 2800 2805 

cat gee tgc aga cag cca gaa acc ccg gca cac gcg gat gtg aga 8469 
His Ala Cys Arg Gin Pro Glu Thr Pro Ala His Ala Asp Val Arg 
2810 2815 2820 
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gcc ate gat ctt cct act ttc ggc tac acc tta gtg tac acc tgc 
Ala lie Asp Leu Pro Thr Phe Gly Tyr Thr Leu Val Tyr Thr Cys 
2825 ~ 2830 2835 



cat cca ggc ttt ttc etc gca 
His Pro Gly Phe Phe Leu Ala 
2840 2845 



999 99 a tct 9 a 9 cac a 9 a aca t 9 t 
Gly Gly Ser Glu His Arg Thr Cys 
2850 



aaa gca gac atg aaa tgg aca gga aag teg cct gtg tgt aaa agt 
Lys Ala Asp Met Lys Trp Thr Gly Lys Ser Pro Val Cys Lys Ser 
2855 2860 2865 



aaa gga gtg aga gaa gtt aat 
Lys Gly Val Arg Glu Val Asn 
2870 2875 



gaa aca gtt act aaa act cca gtt 
Glu Thr Val Thr Lys Thr Pro Val 
2880 



cct tea gat gtc ttt ttc gtc aat tea ctg tgg aag ggg tat tat 
Pro Ser Asp Val Phe Phe Val Asn Ser Leu Trp Lys Gly Tyr Tyr 
2885 2890 2895 

gaa tat tta- ggg aaa aga caa ccc gcc act eta act gtt gac tgg 
Glu Tyr Leu Gly Lys Arg Gin Pro Ala Thr Leu Thr Val Asp Trp 
2900 2905 2910 



ttc aat gca aca age agt aag gtg aat gcc acc ttc 
Phe Asn Ala Thr Ser Ser Lys Val Asn Ala Thr Phe 
2915 2920 2925 

teg cca gtg gag ctg aag ttg aca ggc att tac aag 
Ser Pro Val Glu Leu Lys Leu Thr Gly He Tyr Lys 
2930 2935 2940 



age gaa gcc 
Ser Glu Ala 



aag gag gag 
Lys Glu Glu 



gcc cac tta etc ctg aaa get ttt caa att aaa ggc cag gca gat 
Ala His Leu Leu Leu Lys Ala Phe Gin He Lys Gly Gin Ala Asp 
2945 2950 2955 



att ttt gta age aag ttc gaa 
He Phe Val Ser Lys Phe Glu 
2960 2965 



aat gac aac tgg gga 'eta gat ggt 
Asn Asp Asn Trp Gly Leu Asp Gly 
2970 



tat gtg 
Tyr Val 
2975 



tea tct gga ctt gaa aga gga gga ttt act 
Ser Ser Gly Leu Glu Arg Gly Gly Phe Thr 
2980 2985 



ttt caa ggt 
Phe Gin Gly 



gac att cat gga aaa gac ttt gga aaa ttt aag eta gaa agg caa 
Asp He His Gly Lys Asp Phe Gly Lys Phe Lys Leu Glu Arg Gin 
2990 2995 3000 



gat cct tta aac cca gat caa gac tct tec agt cat 
Asp Pro Leu Asn Pro Asp Gin Asp Ser Ser Ser His 
3005 3010 3015 



tac cac ggc 
Tyr His Gly 



acc age agt ggc tct gtg gcg get gcc att ctg gtt cct ttc ttt 

Thr Ser Ser Gly Ser Val Ala Ala Ala He Leu Val Pro Phe Phe 
3020 * 3025 3030 

get eta att tta tea ggg ttt gca ttt tac etc tac aaa cac aga 

Ala Leu He Leu Ser Gly Phe Ala Phe Tyr Leu Tyr Lys His Arg 
3035 3040 3045 

acg aga cca aaa gtt caa tac aat ggc tat get ggg cat gaa aac 

Thr Arg Pro Lys Val Gin Tyr Asn Gly Tyr Ala Gly His Glu Asn 
3050 3055 3060 



8514 



8559 



8604 



8649 



8694 



8739 



8784 



8829 



8874 



8919 



8964 



9009 



9054 



9099 



9144 



9189 
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age aat gga caa gca teg ttt gaa aac ccc atg tat gat aca aac 9234 

Ser Asn Gly Gin Ala Ser Phe Glu Asn Pro Met Tyr Asp Thr Asn 
3065 3070 3075 

tta aaa ccc aca gaa gec aag get gtg agg ttt gac aca act ctg 9279 

Leu Lys Pro Thr Glu Ala Lys Ala Val Arg Phe Asp Thr Thr Leu 
3080 3085 3090 

aac aca gtc tgt aca gtg gta tagccctcag tgccccaaca ggactgattc 9330 

Asn Thr Val Cys Thr Val Val 
3095 3i00 



atagecatae 


ctctgatgga 


caagcagtga 


ttcctttggt 


gccatatacc actctcccyt 


.9390 


ccactctggc 


tttactgcag 


cgatcttcaa 


ccttgtctac 


tggcataagt geagegggga 


9450 


tctctactca 


aatgtgtcag 


ggtcttctac 


ggatcaaact 


acacatgegt tttcattcca 


9510 


aaagtgggtt 


etaaatgect 


ggctgeatet gtatgaaatc aaggcacact ccaggaagac 


9570 


tgccacgtcg 


cgccaacacg 


tcatactcaa 


trcctcagac 


tttcatattt ctgtgttgct 


9630 


gagatgeett 


teaatgeaat 


cgtctgggct 


cgtggatatg 


tccctcaggt gcggtgacag 


9690 


aatggtggca 


ccacgatatg 


tgttctcttg 


tgttgttttt 


cctttttaaa cccccatgaa 


9750 


cacgaatact 


ctgaaaaaaa 


taaaaagctt 


tctggaagaa 


gacacctttc tgatagaggc 


9810 


tcacacctac 


aaatgettea 


ctctgtcctt 


ccgagacctg 


acaagctttg aggacctcac 


9870 


agctcccctg 


tgtgttcatc 


tctagggatg 


tttgcaattt 


cccagtcagc tgttctgtcg 


9930 


cagaatgttt 


aatgcacaat 


tttttgeact 


agtgtgttat 


gaatgactaa gattctgata 


9990 


aaaaaaataa 


attatttaca 


cagggtttat 


acacactatc 


cattgtatat aagcattatt 


10050 


tcatattatc 


aagctaaaca 


ttcccccatc 


agcttagttg gagtgttagg gaaaagtatt 


10110 


cctagatatg 


gcacagattt 


taaaaggaaa 


tacagtattg 


acgagattta ttttattatt 


10170 


gcttcaatta 


gctccattta 


cgtgttgaat 


tcattgaaga 


ggtccaatga gaaaaaaaca 


10230 


gaagcctcct 


tatttcacac 


gttttcctcc 


tttagtacca 


tcctcatcca attactgtct 


10290 


ctctgatact 


acttaatagc 


agggggtttg 


cagaaatttc 


tgtttgccat gtaaaactgt 


10350 


gaatagtaat 


ttattttaga 


tagtcgatga 


acttgtgggt 


tttagctcac aatgeagect 


10410 


tcccttttgc 


agtgtttttt 


ttt 






10433 



<210> 7 

<211> 3100 

<212> PRT 

<213> Homo sapiens 



<400> 7 
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Thr Leu Thr Val Gly Asp Ala Gly Lys Val Gly Asp Thr Arg Ser Val 
1 5 10 15 

Leu Tyr Val Leu Thr Gly Ser Ser Val Pro Asp Leu lie Val Ser Met 
20 25 30 

Ser Asn Gin Met Trp Leu His Leu Gin Ser Asp Asp Ser lie Gly Ser 
35 40 45 

Pro Gly Phe Lys Ala Val Tyr Gin Glu lie Glu Lys Gly Gly Cys Gly 
50 55 60 

Asp Pro Gly lie Pro Ala Tyr Gly Lys Arg Thr Gly Ser Ser Phe Leu 
65 70 75 80 

His Gly Asp Thr Leu Thr Phe Glu Cys Pro Ala Ala Phe Glu Leu Val 
85 90 95 

Gly Glu Arg Val He Thr Cys Gin Gin Asn Asn Gin Trp Ser Gly Asn 
100 105 110 

Lys Pro Ser Cys Val Phe Ser Cys Phe Phe Asn Phe Thr Ala Ser Ser 
115 120 125 

Gly He He Leu Ser Pro Asn Tyr Pro Glu Glu Tyr Gly Asn Asn Met 
130 135 140 

Asn Cys Val Trp Leu He He Ser Glu Pro Gly Ser Arg He His Leu 
145 150 155 160 

He Phe Asn Asp Phe Asp Val Glu Pro Gin Phe Asp Phe Leu Ala Val 
165 170 175 

Lys Asp Asp Gly He Ser Asp He Thr Val Leu Gly Thr Phe Ser Gly 
180 185 190 

Asn Glu Val Pro Ser Gin Leu Ala Ser Ser Gly His He Val Arg Leu 
195 200 205 

Glu Phe Gin Ser Asp His Ser Thr Thr Gly Arg Gly Phe Asn He Thr 
210 215 220 

Tyr Thr Thr Phe Gly Gin Asn Glu Cys His Asp Pro Gly He Pro He 
225 230 235 240 



Asn Gly Arg Arg Phe Gly Asp Arg Phe Leu Leu Gly Ser Ser Val Ser 
245 250 255 
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Phe His Cys Asp Asp Gly Phe Val Lys Thr Gin Gly Ser Glu Ser lie 

: as- ; 260 265 270 



Thr Cys lie Leu Gin Asp Gly Asn Val Val Trp Ser Ser Thr Val Pro 
275 280 285 



Arg Cys Glu Ala Pro Cys Gly Gly His Leu Thr Ala Ser Ser Gly Val 
— 290 295 300 



lie Leu tro fro Gly urp fro Gly uyr ryr .uys Asp Ser Leu His uys 
305 310 315 320 



Glu Trp lie lie Glu Ala Lys Pro Gly His Ser lie Lys lie Thr Phe 
325 330 335 



Asp Arg Phe Gin Thr Glu Val Asn Tyr Asp Thr Leu Glu Val Arg Asp 
340 345 350 

Gly Pro Ala Ser Ser Ser Pro Leu lie Gly Glu Tyr His Gly Thr Gin 
355 360 " 365 



Ala Pro Gin Phe Leu He Ser Thr Gly Asn Phe Met Tyr Leu Leu Phe 
370 375 380 



Thr Thr Asp Asn Ser Arg Ser Ser He Gly Phe Leu He His Tyr Glu 
385 390 395 400 



Ser Val Thr Leu Glu Ser Asp Ser Cys Leu Asp Pro Gly He Pro Val 
405 410 415 



Asn Xaa His Arg His Gly Gly Asp Phe Gly He Arg Ser Thr Val Thr 
420 425 430 

Phe Ser Cys Asp Pro Gly Tyr Thr Leu Ser Asp Asp Glu Pro Leu Val 
435 440 445 



Cys Glu Arg Asn His Glh Trp Asn His Ala Leu Pro Ser Cys Asp Ala 
450 455 460 



Leu Cys Gly Gly Tyr He Gin Gly Lys Ser Gly Thr Val Leu Ser Pro 
465 470 475 480 

Gly Phe Pro Asp Phe Tyr Pro Asn Ser Leu Asn Xaa Thr Trp Thr He 
485 490 495 

Glu Val Ser His Gly Lys Gly Val Gin Met He Phe His Thr Phe His 
500 505 510 
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Leu Glu Ser Ser His Asp Tyr Leu Leu He Thr Glu Asp Gly Ser Phe 
\ 515 520 525 

Ser Glu Pro Val Ala Arg Leu Thr Gly Ser Val Leu Pro His Thr lie 
530 535 540 

* »* • i 

Lys Ala Gly Leu Phe Gly Asn Phe Thr Ala Gin Leu Arg Phe He Ser 
5#5 " 550 555 560 

Asp Phe Ser He Ser Tyr Glu Gly Phe Asn He Thr Phe Ser Glu Tyr 
565 570 575 

Asp Leu Glu Pro Cys Asp Asp Pro Gly Val Pro Ala Phe Ser Arg Arg 
580 585 590 

lie Gly Phe His Phe Gly Val Gly Asp Ser Leu Thr Phe Ser Cys Phe 
. 595 600 605 

Leu Gly Tyr Arg Leu Glu Gly Ala Xaa Lys Leu Thr Cys Leu Gly Gly 
61.0 615 620 

Gly Arg Arg Val Trp Ser Ala Pro Leu Pro Arg Cys Val Ala Glu Cys 
625 630 635 640 

Gly Ala Ser Val Lys Gly Asn Glu Gly Thr Leu Leu Ser Pro Asn Phe 
645 650 655 

Pro Ser Asn Tyr Asp Asn Asn His Glu Cys He Tyr Lys He Glu Thr 
660 665 670 

Glu Ala Gly Lys Gly He His Leu Arg Thr Arg Ser Phe Gin Leu Phe 
675 680 685 

Glu Gly Asp Thr Leu Lys Val Tyr Asp Gly Lys Asp Ser Ser Ser Arg 
690 ~ 695 700 

Pro Leu Gly Thr Phe Thr Lys Asn Glu Leu Leu Gly Leu He Leu Asn 
705 710 715 720 

Ser Thr Ser Asn His Xaa Trp Leu Glu Phe Asn Thr Asn Gly Ser Asp 
725 730 735 

Thr Asp Gin Gly Phe Gin Leu Thr Tyr Thr Ser Phe Asp Leu Val Lys 
740 745 750 



Cys Glu Asp Pro Gly He Pro Asn Tyr Gly Tyr Arg He Arg Asp Glu 
755 760 765 
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Gly His Phe Thr Asp Thr Val Val Leu Tyr Ser Cys Asn Pro Gly Tyr 
770 775 780 



Ala Met His Gly Ser Asn Thr Leu Thr Cys Leu Ser Gly Asp Arg Arg 
785 790 795 800 



Val Trp Asp Lys Pro Leu Pro Ser Cys He Ala Glu Cys Gly Gly Gin 
$05 810 815 



He His Ala Ala Thr Ser Gly Arg He Leu Ser Pro Gly Tyr Pro Ala 
820 825 830 



Pro Tyr Asp Asn Asn Leu His Cys Thr Trp He He Glu Ala Asp Pro 
835 840 845 



Gly Lys Thr He Ser Leu His Phe He Val Phe Asp Thr Glu Met Ala 
850 855 860 



His Asp He Leu Lys Val Trp Asp Gly Pro Val Asp Ser Asp He Leu 
865 870 - 875 880 



Leu Lys Glu Trp Ser Gly Ser Ala Leu Pro Glu Asp He His Ser Thr 
885 890 895 

Phe Asn Ser Leu Thr Leu Gin Phe Asp Ser Asp Phe Phe He Ser Lys 
900 905 910 

Ser Gly Phe Ser He Gin Phe Ser Thr Ser He Ala Ala Thr Cys Asn 
915 920 925 

Asp Pro Gly Met Pro Gin Asn Gly Thr Arg Tyr Gly Asp Ser Arg Glu 
930 935 ~ ^ 940 

Ala Gly Asp Thr Val Thr Phe Gin Cys Asp Pro Gly Tyr Gin Leu Gin 
945 950 955 960 

Gly Gin Ala Lys He Thr Cys Val Gin Leu Asn Asn Arg Phe Phe Trp 
965 970 975 

Gin Pro Asp Pro Pro Thr Cys lie Ala Ala Cys Gly Gly Asn Leu Thr 
980 985 990 

Gly Pro Ala Gly Val He Leu Ser Pro Asn Tyr Pro Gin Pro Tyr Pro 
995 1000 1005 



Pro Gly Lys Glu Cys Asp Trp Arg Val Lys Val Asn Pro Asp Phe 
1010 1015 1020 
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Val He Ala Leu He Phe Lys Ser Phe Asn Met Glu Pro Ser Tyr 
1025 1030 1035 

Asp Phe Leu His He Tyr Glu Gly Glu Asp Ser Asn Ser Pro Leu 
1040 1045 1050 

lie Gly Ser Tyr Gin Gly Ser Gin Ala Pro Glu Arg He Glu Ser 
1055 1060 1065 

Ser Gly Asn Ser Leu Phe Leu Ala Phe Arg Ser Asp Ala Ser Val 
1070 1075 1080 

Gly Leu Ser Gly Phe Ala He Glu Phe Lys Glu Lys Pro Arg Glu 
1085 1090 1095 

Ala Cys Phe Asp Pro Gly Asn He Met Asn Gly Thr Arg Val Gly 
1100 1105 1110 

Thr Asp Phe Lys Leu Gly Ser Thr He Thr Tyr Gin Cys Asp Ser 
1115 1120 1125 

Gly Tyr Lys He Leu Asp Pro Ser Ser He Thr Cys Val He Gly 
1130 1135 1140 

Ala Asp Gly Lys Pro Ser Trp Asp Gin Val Leu Pro Ser Cys Asn 
1145 ~ " 1150 1155 

Ala Pro Cys Gly Gly Gin Tyr Thr Gly Ser Glu Gly Val Val Leu 
1160 ~ 1165 1170 

Ser Pro Asn Tyr Pro His Asn Tyr Thr Ala Gly Gin lie Cys Leu 
1175 1180 1185 

Tyr Ser He Thr Val Pro Lys Glu Phe Val Val Phe Gly Gin Phe 
1190 1195 1200 

Ala Tyr Phe Gin Thr Ala Leu Asn Asp Leu Ala Glu Leu Phe Asp 
1205 1210 1215 

Gly Thr His Ala Gin Ala Arg Leu Leu Ser Ser Leu Ser Gly Ser 
1220 1225 1230 

His Ser Gly Glu Thr Leu Pro Leu Ala Thr Ser Asn Gin He Leu 
1235 1240 1245 

Leu Arg Phe Ser Ala Lys Ser Gly Ala Ser Ala Arg Gly Phe His 
1250 1255 1260 
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Phe Val Tyr Gin Ala Val Pro Arg Thr Ser Asp Thr Gin Cys Ser 
12$5 1270 1275 



Ser Val Pro Glu Pro Arg Tyr Gly Arg Arg He Gly Ser Glu Phe 
1280 1285 1290 



Ser Ala Gly Ser He Val Arg Phe Glu Xaa Asn Pro Gly Tyr Leu 
1295 1300 1305 



Leu uin Gly ser Thr Ala Leu His Cys Gin Ser Val Pro Asn Ala 
1310 1315 1320 



Leu Ala Gin Trp Asn Asp Thr He Pro Ser Cys Val Val Pro Cys 
1325 1330 1335 



Ser Gly Asn Phe Thr Gin Arg Arg Gly Thr He Leu Ser Pro Gly 
1340 1345 1350 



Tyr Pro Glu Pro Tyr Gly Asn Asn Leu Asn Cys He Trp Lys He 
1355 1360 1365 



He Val Thr Glu Gly Ser Gly He Gin He Gin Val He Ser Phe 
1370 1375 1380 



Ala Thr Glu Gin Asn Trp Asp Ser Leu Glu He His Asp Gly Gly 
1385 1390 1395 



Asp Val Thr Ala Pro Arg Leu Gly Ser Phe Ser Gly Thr Thr Val 
1400 1405 1410 



Pro Ala Leu Leu Asn Ser Thr Ser Asn Gin Leu Tyr Leu His Phe 
1415 1420 1425 



Gin Ser Asp He Ser Val Ala Ala Ala Gly Phe His Leu Glu Tyr 
1430 1435 1440 



Lys Thr Val Gly Leu Ala Ala Cys Gin Glu Pro Ala Leu Pro Ser 
1445 1450 1455 



Asn Ser He Lys He Gly Asp Arg Tyr Met Val Asn Asp Val Leu 
1460 1465 ' 1470 



Ser Phe Gin Cys Glu Pro Gly Tyr Thr Leu Gin Gly Arg Ser His 
1475 1480 1485 



He Ser Cys Met Pro Gly Thr Val Arg Arg Trp Asn Tyr Pro Ser 
1490 1495 1500 
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Pro Leu Cys lie Ala Thr Cys Gly Gly Thr Leu Ser Thr Leu Gly 
1505 1510 1515 

Gly Val lie Leu Ser Pro Gly Phe Pro Gly Ser Tyr Pro Asn Asn 
1520 1525 1530 

lieu Asp Cys Thr Trp Arg lie Ser Leu Pro lie Gly Tyr Gly Ala 
1535 1540 1545 

His lie Gin Phe Leu Asn Phe Ser Thr Glu Ala Asn His Asp Phe 
1550 1555 1560 

Leu Glu lie Gin Asn Gly Pro Tyr His Thr Ser Pro Met lie Gly 
1565 1570 1575 

Gin Phe Ser Gly Thr Asp Leu Pro Ala Ala Leu Leu Ser Thr Thr 
1580 1585 1590 

His Glu Thr Leu lie His Phe Tyr Ser Asp His Ser Gin Asn Arg 
1595 1600 1605 

Gin Gly Phe Lys Leu Ala Tyr Gin Ala Tyr Glu Leu Gin Asn Cys 
1610 "* 1615 1620 

Pro Asp Pro Pro Pro Phe Gin Asn Gly Tyr Met lie Asn Ser Asp 
1625 1630 1635 

Tyr Ser Val Gly Gin Ser Val Ser Phe Glu Cys Tyr Pro Gly Tyr 
1640 1645 1650 

He Leu He Gly His Pro Val Leu Thr Cys Gin His Gly He Asn 
1655 1660 1665 

Arg Asn Trp Asn Tyr Pro Phe Pro Arg Cys Asp Ala Pro Cys Gly 
1670 1675 1680 

Tyr Asn Val Thr Ser Gin Asn Gly Thr He Tyr Ser Pro Gly Phe 
1685 1690 1695 

Pro Asp Glu Tyr Pro He Leu Lys Asp Cys He Trp Leu He Thr 
1700 1705 1710 

Val Pro Pro Gly His Gly Val Tyr He Asn Phe Thr Leu Leu Gin 
1715 1720 1725 



Thr Glu Ala Val Asn Asp Tyr He Ala Val Trp Asp Gly Pro Asp 
1730 1735 1740 
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Gin Asn Ser Pro Gin Leu Gly Val Phe Ser Gly Asn Thr Ala Leu 
17£5 1750 1755 

Glu Thr Ala Tyr Ser Ser Thr Asn Gin Val Leu Leu Lys Phe His 
1760 1765 1770 



Sex Asp Phe Ser Asn Gly Gly Phe Phe Val Leu Asn Phe His Ala 
1775 1780 1785 



fhe vain Leu Lys Lys cys uln Pro Pro Pro Ala val Pro Gin Ala 
1790 1795 1800 

Glu Met Leu Thr Glu Asp Asp Asp Phe Glu lie Gly Asp Phe Val 
1805 1810 1815 



Lys Tyr Gin Cys His Pro Gly Tyr Thr Leu Val Gly Thr Asp lie 
1820 1825 1830 



Leu Thr Cys LyB Leu Ser Ser Gin Leu Gin Phe Glu Gly Ser Leu 
1835 1840 1845 



Pro Thr Cys Glu Ala Gin Cys Pro Ala Asn Glu Val Arg Thr Gly 
1850 1855 1860 



Ser Ser Gly Val lie Leu Ser Pro Gly Tyr Pro Gly Asn Tyr Phe 
1865 1870 1875 



Asn Ser Gin Thr Cys Ser Trp Ser lie Lys Val Glu Pro Asn Tyr 
1880 1885 ~ 1890 



Asn lie Thr lie Phe Val Asp Thr Phe Gin Ser Glu Lys Gin Phe 
1895 1900 1905 



Asp Ala Leu Glu Val Phe Asp Gly Ser Ser Gly Gin Ser Pro Leu 
1910 1915 * 1920 



Leu Val Val Leu Ser Gly Asn His Thr Glu Gin Ser Asn Phe Thr 
1925 1930 1935 



Ser Arg Ser Asn Gin Leu Tyr Leu Arg Trp Ser Thr Asp His Ala 
1940 1945 1950 



Thr Ser Lys Lys Gly Phe Lys lie Arg Tyr Ala Ala Pro Tyr Cys 
1955 1960 1965 



Ser Leu Thr His Pro Leu Lys Asn Gly Gly lie Leu Asn Arg Thr 
1970 1975 " 1980 
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Ala Gly Ala Val Gly Ser Lys Val His Tyr Phe Cys Lys Pro Gly 
1985 1990 1995 

Tyr Arg Met Val Gly His Ser Asn Ala Thr Cys Arg Arg Asn Pro 
2000 2005 2010 

*■ **■ i 

Leu Gly Met Tyr Gin Trp Asp Ser Leu Thr Pro Leu Cys Gin Ala 
2015 2020 2025 

Val Ser Cys Gly He Pro Glu Ser Pro Gly Asn Gly Ser Phe Thr 
2030 2035 2040 

Gly Asn Glu Phe Thr Leu Asp Ser Lys Val Val Tyr Glu Cys His 
2045 2050 2055 

Glu Gly Phe Lys Leu Glu Ser Ser Gin Gin Ala Thr Ala Val Cys 
2060 2065 2070 

Gin Glu Asp Gly Leu Trp Ser Asn Lys Gly Lys Pro Pro Thr Cys 
2075 " 2080 2085 

Lys Pro Val Ala Cys Pro Ser He Glu Ala Gin Leu Ser Glu His 
2090 2095 2100 

Val He Trp Arg Leu Val Ser Gly Ser Leu Asn Glu Tyr Gly Ala 
2105 2110 2115 

Gin Val Leu Leu Ser Cys Ser Pro Gly Tyr Tyr Leu Glu Gly Trp 
> 2120 2125 2130 

Arg Leu Leu Arg Cys Gin Ala Asn Gly Thr Trp Asn He Gly Asp 
2135 ~ 2140 2145 

Glu Arg Pro Ser Cys Arg Val He Ser Cys Gly Ser Leu Ser Phe 
2150 2155 2160 

Pro Pro Asn Gly Asn Lys He Gly Thr Leu Thr Val Tyr Gly Ala 
2165 2170 2175 

Thr Ala He Phe Thr Cys Asn Thr Gly Tyr Thr Leu Val Gly Ser 
2180 2185 2190 

His Val Arg Glu Cys Leu Ala Asn Gly Leu Trp Ser Gly Ser Glu 
2195 2200 2205 



Thr Arg Cys Leu Ala Gly His Cys Gly Ser Pro Asp Pro He Val 
2210 2215 2220 
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Asn Gly His lie Ser Gly Asp Gly Phe Ser Tyr Arg Asp Thr Val 
2225 ~ 2230 2235 



Val Tyr Gin Cys Asn Pro Gly Phe Arg Leu Val Gly Thr Ser Val 
2240 2245 2250 



Arg lie Cys Leu Gin Asp His Lys Trp Ser Gly Gin Thr Pro Val 
2255 ~ 2.260 " 2265 



uys Vai Pro ±ie xnr eys lily His Pro Gly Asn Pro Ala His Gly 
2270 2275 2280 



Phe Thr Asn Gly Ser Glu Phe Asn Leu Asn Asp Val Val Asn Phe 
2285 2290 2295 



Thr Cys Asn Thr Gly Tyr Leu Leu Gin Gly Val Ser Arg Ala Gin 
2300 2305 2310 



Cys Arg Ser Asn Gly Gin Trp Ser Ser Pro Leu Pro Thr Cys Arg 
2315 2320 2325 



Val Val Asn Cys Ser Asp Pro Gly Phe Val Glu Asn Ala lie Arg 
2330 2335 2340 



His Gly Gin Gin Asn Phe Pro Glu Ser Phe Glu Tyr Gly Met Ser 
2345 2350 2355 



lie Leu Tyr His Cys Lys Lys Gly Phe Tyr Leu Leu Gly Ser Ser 
2360 2365 2370 



Ala Leu Thr Cys Met Ala Asn Gly Leu Trp Asp Arg Ser Leu Pro 
2375 2380 2385 



Lys Cys Leu Ala lie Ser Cys Gly His Pro Gly Val Pro Ala Asn 
2390 2395 2400 



Ala Val Leu Thr Gly Glu Leu . Phe Thr Tyr Gly Ala Val Val His 
2405 2410 2415 



Tyr Ser Cys Arg Gly Ser Glu Ser Leu lie Gly Asn Asp Thr Arg 
2420 2425 2430 



Val Cys Gin Glu Asp Ser His Trp Ser Gly Ala Leu Pro His Cys 
2435 2440 " 2445 



Thr Gly Asn Asn Pro Gly Phe Cys Gly Asp Pro Gly Thr Pro Ala 
2450 2455 2460 
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His Gly Ser Arg Leu Gly Asp Asp Phe Lys Thr Lys Ser Leu Leu 
2465 2470 2475 

Arg Phe Ser Cys Glu Met Gly His Gin Leu Arg Gly Ser Pro Glu 
2480 2485 2490 

Arg Thr Cys Leu Leu Asn Gly Ser Trp Ser Gly Leu Gin Pro Val 
2495 2500 2505 

Cys Glu Ala Val Ser Cys Gly Asn Pro Gly Thr Pro Thr Asn Gly 
2510 2515 2520 

Met lie Val Ser Ser Asp Gly lie Leu Phe Ser Ser Ser Val lie 
2525 2530 2535 

Tyr Ala Cys Trp Glu Gly Tyr Lys Thr Ser Gly Leu Met Thr Arg 
2540 2545 2550 

His Cys Thr Ala Asn Gly Thr Trp Thr Gly Thr Ala Pro Asp Cys 
2555 2560 2565 

Thr lie lie Ser Cys Gly Asp Pro Gly Thr Leu Ala Asn Gly He 
2570 2575 2580 

Gin Phe Gly Thr Asp Phe Thr Phe Asn Lys Thr Val Ser Tyr Gin 
2585 2590 2595 

Cys Asn Pro Gly Tyr Val Met Glu Ala Val Thr Ser Ala Thr He 
2600 2605 2610 

Arg Cys Thr Lys Asp Gly Arg Trp Asn Pro Ser Lys Pro Val Cys 
2615 2620 2625 

Lys Ala Val Leu Cys Pro Gin Pro Pro Pro Val Gin Asn Gly Thr 
2630 2635 2640 

Val Glu Gly Ser Asp Phe Arg Trp Gly Ser Ser He Ser Tyr Ser 
2645 2650 2655 

Cys Met Asp Gly Tyr Gin Leu Ser His Ser Ala He Leu Ser Cys 
2660 & 2665 2670 

Glu Gly Arg Gly Val Trp Lys Gly Glu He Pro Gin Cys Leu Pro 
2675 2680 2685 



Val Phe Cys Gly Asp Pro Gly He Pro Ala Glu Gly Arg Leu Ser 
2690 2695 2700 
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Gly Lys Ser Phe Thr Tyr Lys Ser Glu Val Phe Phe Gin Cys Lys 
2705 2710 2715 



Ser Pro Phe lie Leu Val Gly Ser Ser Arg Arg Val Cys Gin Ala 
2720 2725 2730 



Asp Gly Thr Trp Ser Gly lie Gin Pro Thr Cys lie Asp Pro Ala 
— . 2735 2740 2745 



His Asn Thr Cys Pro Asp Pro Gly Thr Pro His Phe Gly lie Gin 
2750 2755 2760 



Asn Ser Ser Arg Gly Tyr Glu Val Gly Ser Thr Val Phe Phe Arg 
2765 2770 2775 



Cyjs Arg Lys Gly Tyr His lie Gin Gly Ser Thr Thr Arg Thr Cys 
2780 2785 2790 



Leu Ala Asn Leu Thr Trp Ser Gly lie Gin Thr Glu Cys lie Pro 
2795 2800 2805 



His Ala Cys Arg Gin Pro Glu Thr Pro Ala His Ala Asp Val Arg 
2810 2815 2820 



Ala lie Asp Leu Pro Thr Phe Gly Tyr Thr Leu Val Tyr Thr Cys 
2825 2830 2835 



His Pro Gly Phe Phe Leu Ala Gly Gly Ser Glu His Arg Thr Cys 
2840 2845 2850 



Lys Ala Asp Met Lys Trp Thr Gly Lys Ser Pro Val Cys Lys Ser 
2855 2860 "* 2865 



Lys Gly Val Arg Glu Val Asn Glu Thr Val Thr Lys Thr Pro Val 
2870 2875 2880 



Pro Ser Asp Val Phe Phe Val Asn Ser Leu Trp Lys Gly Tyr Tyr 
2885 2890 2895 



Glu Tyr Leu Gly Lys Arg Gin Pro Ala Thr Leu Thr Val Asp Trp 
2900 2905 2910 



Phe Asn Ala Thr Ser Ser Lys Val Asn Ala Thr Phe Ser Glu Ala 
2915 2920 2925 



Ser Pro Val Glu Leu Lys Leu Thr Gly lie Tyr Lys Lys Glu Glu 
2930 2935 ~ * 2940 
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Ala His Leu Leu Leu Lys Ala Phe Gin lie Lys Gly Gin Ala Asp 
2945 2950 2955 

He Phe Val Ser Lys Phe Glu Asn Asp Asn Trp Gly Leu Asp Gly 
2960 2965 2970 

Tyr Val Ser Ser Gly Leu Glu Arg Gly Gly Phe Thr Phe Gin Gly 
• 2975 2980 2985 

Asp He His Gly Lys Asp Phe Gly Lys Phe Lys Leu Glu Arg Gin 
2990 2995 3000 

Asp Pro Leu Asn Pro Asp Gin Asp Ser Ser Ser His Tyr His Gly 
3005 3010 3015 

Thr Ser Ser Gly Ser Val Ala Ala Ala He Leu Val Pro Phe Phe 
3020 3025 3030 

Ala Leu He Leu Ser Gly Phe Ala Phe Tyr Leu Tyr Lys His Arg 
3035 3040 3045 

Thr Arg Pro Lys Val Gin Tyr Asn Gly Tyr Ala Gly His Glu Asn 
3050 3055 3060 

Ser Asn Gly Gin Ala Ser Phe Glu Asn Pro Met Tyr Asp Thr Asn 
3065 ~ 3070 ' 3075 

Leu Lys Pro Thr Glu Ala Lys Ala Val Arg Phe Asp Thr Thr Leu 
3080 3085 3090 



Asn Thr Val Cys Thr Val Val 
3095 3100 
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